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Message from the President 


Setsuho IKEHATA (President, Tokyo University of Foreign Studies) 


The 21st Century COE (“Center of Excellence”) Program, launched by 
the Ministry of Education, Sports, Culture, Science and Technology in 2002, 
grants subsidies to distinguished universities in our country for the 
establishment of a center of research and education in various fields with the 
highest academic standards in the world. It aims at raising the level of 
research in our country’s universities and fostering creative academic minds, 
expected to become leaders of the world. Tokyo University of Foreign 
Studies (TUFS) submitted applications for research projects in two of the 
selected programs—the Humanities and Interdisciplinary/Compound/New 
Sphere fields. We have obtained wonderful results; both projects were 
selected. We are extremely pleased and encouraged by this high evaluation 
of the unique research projects and educational potential of our Graduate 
School of Area and Culture Studies. 

To run the program, TUFS has accepted outstanding experts from all 
over the world—Europe, North and South America, Oceania, Asia, and 
Africa—who collaborate on education and research in a wide range of 
academic fields including linguistics, literature, history, philosophy, cultural 
anthropology, sociology, political science, and economics. Thus, we have 
attained an extremely consistent interdisciplinary and comprehensive 
approach for a single-faculty university. In an age that emphasizes the global 
community, it is certainly desirable for us to maximize and further develop 
this unique strength in both education and research. 

A strong foundation in foreign languages is vital to area and culture 
studies. TUFS engages in education and research in over 50 languages, 
cultures and societies in every part of the world, which contributes to 
cross-cultural understanding and the development of people capable of 
contributing to the actualization of a harmonious global community. In 
addition, a double-major system that requires students to specialize in both a 
language and a discipline-related course of study enables TUFS to produce 
graduates equipped with a high degree of language competence and a deep 
knowledge of world cultures and societies. 

Our new campus in Fuchu is proudly equipped with the state-of-the-art 
computing network. The most outstanding feature is the information literacy 
and the number of computers on campus, which ranks at the top level among 
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liberal arts universities in our country. With such priviledged information 
infrastructure, TUFS endeavors to make the best use of multimedia, the 
internet and other devices, in order to develop the most advanced language 
education. 

The University’s Usage-Based Linguistic Informatics project, selected 
by the 21st Century COE Program, is the concrete manifestation of our plans 
for the future, which I have just mentioned. The implementation team 
members are committed to this future vision and vigorously engaged in the 
project. It is my fervent desire that they will produce rewarding results. It is 
the intention of everyone at TUFS to combine our wisdom in a concerted 
effort to do our utmost to make a success of the 21st Century COE Program. 

With a view to providing full support to the program, TUFS has 
established the “21st Century COE Program Administration Office” which is 
directly responsible to myself, the President. This Office is an inter-sectional 
organization consisted of the President, the Vice-President, the deans of each 
division, the Program Leader, as well as the managers of the secretariat. Its 
important role is to enhance the cooperation between the various sections 
within TUFS and to administrate the use of the space and the budget 
allocated for research. 


Tokyo, December, 2005 


wy 
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Center of Usage-Based Linguistic 
Informatics (UBLI) 


Yuji KAWAGUCHI (COE Program Leader, Tokyo University of Foreign Studies) 


1. Linguistic Informatics 

It is widely believed that linguistic theories and computer sciences have 
greatly influenced foreign language education, however the collaboration of 
these three domains has not brought about new scientific results. The present 
program aims to meet such a scientific need. An overall integration of 
Theoretical and Applied Linguistics will be realized on the basis of 
Computer Sciences. We have named this synthetic field Linguistic 
Informatics. When we hear this name for the first time, we may take it for a 
branch of natural sciences. However, since our language represents a system 
of information, linguistics itself constitutes, in a broad sense, a part of 
informatics. In the following lines, due to the limitation of space, I will 
explain only the essence of this 21st Century COE (Center of Excellence) 
Program. 


COE Program Promoters 


Yuji KAWAGUCHI French and Turkish Linguistics 
Susumu ZAIMA German Linguistics 

Nobuo TOMIMORI Romance Linguistics 
Toshihiro TAKAGAKI Spanish Linguistics 

Yoichiro TSURUGA French Linguistics 

Ikuo KAMEYAMA Russian Literature 

Akira MIZUBAYASHI French Literature, History 
Hideki NOMA Korean Linguistics 

Kohji SHIBANO Information Technology 
Makoto MINEGISHI Theoretical Linguistics 
Mayumi USAMI Social Psychology of Language 


2. Organization and Research Projects 

The present COE program is directed by the following supervisors: 
Susumu ZAIMA, Toshihiro TAKAGAKI, Yoichiro TSURUGA, Kobhji 
SHIBANO, Makoto MINEGISHI, Mayumi USAMI and Yuji KAWAGUCHI. 
In 2004-2005, the following research projects are undertaken in three 
scientific fields respectively. 
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4 Yuji KAWAGUCHI 


Research Projects in 2004-2005 


LINGUISTICS: 


Corpus-based analysis of sentence structures and cross-linguistic perspectives in phonetics 


Yoichiro TSURUGA Impersonal constructions in French 
Yuji KAWAGUCHI Diachronic research on negative constructions in French 
Naotoshi KUROSAWA Word order of modifier and modified constituent in Latin 


and Portuguese 


Kiyoko SOHMIYA 


Aspects of marked constructions as seen in corpora 


Kazuyuki URATA Diachronic research on the subjunctive in English 
Susumu ZAIMA, oe 
Takashi NARITA Corpus-based research on verb construction in German 
Toshihiro TAKAGAKI Construction of a Spanish corpus and the development of 
relevant tools to advance Spanish language research 
Hidehiko NAKAZAWA _| Utilization of a corpus for research on Russian verbs 
Takayuki MIYAKE Research on the syntactic characteristics of Chinese verbs 
based on corpus analysis 
Comparative study of compound verbs in Japanese and 
Keiko MOCHIZUKI Chinese that express “causal phenomena and resultant 
phenomena” and their corresponding English sentence 
structures 
Shinjiro KAZAMA Descriptive study of grammar using spoken and literary 


corpora 


Isamu SHOHO 


The causes and results of marked word order in the 
Malaysian language 


Yoshio SAITO 


Satoko YOSHIE Construction of a Wakhi vocabulary corpus 
Shinji YAMAMOTO The Italian language in the 21* century 
Yuji KAWAGUCHI, 
Tsunekazu MORIGUCHI, 
Nobuo TOMIMORI, Prosodic analysis of speech through the TUFS Dialogue 
Hiroko SAITO, Module 
Masashi FURIHATA, 


APPLIED LINGUISTICS : 


Discourse Analysis, Second Language Acquisition, Evaluation of TUFS Language Modules 


Construction and analysis of a multilingual corpus of spoken 


language 
Mayumi USAMI Basic Tesearch on methodology for natural conversation 
analysis 
Development of a basic transcription system for Japanese, 
Korean, Chinese and English. 
Construction and analysis of Japanese learner-language 
Tae UMINO oa 


Basic research aimed at the development of learner’s 
manual for ‘Japanese Dialogue Module’ 
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Asako YOSHITOMI 


Construction of an English learner language corpus 


Revision of the English Dialogue Module teacher's manual 


Masashi NEGISHI, 


Hideyuki TAKASHIMA, | Development of a Language Proficiency Scale 
Masanori ICHIKAWA, __| Assessment of TUFS Language Modules 


Koyo YAMAMORI 


COMPUTER SCIENCES: 
E-learning, Natural Language Processing 


Hiroshi SANO 
Chun Chen LIN 


language education 
Construction of e-learning system 


Construction of an educational material corpus for Japanese 


In principle, these projects are considered as fundamental research for 
the development of TUFS Language Modules, which are the very fruits of 
Linguistic Informatics and the significant scientific contribution of this COE. 


3. TUFS Language Modules 

3.1. Cohabitation of Natural Language and Machine Language 
Our main objective is to innovate foreign language education by 

developing superior educational materials and delivering them over the 

Internet. At present, the following 17 languages are covered in the TUFS 


Language Modules. 

Editors of Pronunciation, Dialogue and Grammar Modules 

Cross-Linguistic | Makoto MINEGISHI 

Grammar 

English Keizo NOMURA, Hiroko SAITO, Kazuyuki URATA, 
Asako YOSHITOMI 

German Takashi NARITA, Akiko MASAKI 

French Yuji KAWAGUCHI, Akira MIZUBAYASHI 

Spanish Shigenobu KAWAKAMI, Toshihiro TAKAGAKI 

Portuguese Naotoshi KUROSAWA, Chika TAKEDA 

Russian Hidehiko NAKAZAWA 

Chinese Kazuyuki HIRAI, Takayuki MIYAKE 

Korean Eui-sung CHO, Koichi IKARASHI, Hideki NOMA 

Mongolian Yoshio SAITO, Renzo NUKUSHINA, Hideyuki OKADA 

Indonesian Masashi FURIHATA 

Filipino Tsunekazu MORIGUCHI, Michiko YAMASHITA 

Lao Reiko SUZUKI 

Cambodian Hiromi UEDA, Tomoko OKADA 

Vietnamese Yoshio UNE, Hiroki TAHARA 

Arabic Robert RATCLIFFE 

Turkish Mutsumi SUGAHARA 

Japanese Futoshi KAWAMURA, Yumiko SATO, Tae UMINO 
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6 Yuji KAWAGUCHI 


This is a large-scale project that includes more than 100 researchers and 
graduate students. In the TUFS Language Modules, the multilingual 
language learning system is one of the main features. In fact, we teach more 
than 40 different languages at TUFS. But the novelty of the TUFS Language 
Modules lies in another fact. For example, 17 languages are described in 
unicode (UTF-8), and in our system, HTML, a basic language of the World 
Wide Web (WWW), is correlated with XML, which was first invented in 
1998 and has recently begun to be applied on the WWW. 

This project also has educational ends for graduate students, who 
undertake the role of preparing the primary materials for the structuring of 
the modules. Through this research activity, they will gain the knowledge not 
only of Linguistics and Applied Linguistics, but also of Computer Sciences. 
In this way, the graduate program will foster new types of linguistic 
researchers — those who have full knowledge of Theoretical and Applied 
Linguistics and who can manipulate a computer-assisted language learning 
system. 


3.2. Modularized View of Language 

With the advent of the Internet, we have become conscious of the 
omnipresence of information, that is, what we call ubiquity of information. 
On the other hand, the WWW gives us an oppurtunity to re-think over how 
and what that information should be. On the WWW, theoretically speaking, 
infinite ordering and a combination of information are possible through their 
mutual linkages. In the TUFS Language Modules, we set our way of thinking 
free from a traditional view of language and adopt a modularized view of 
language. Each language unit is composed of four relatively independent 
modules, namely, pronunciation, dialogue, grammar and vocabulary modules. 
The idea of module components allows learners and teachers to learn and 
teach the target language from whichever part of the modules and in 
whatever order. 


3.3. Cross-Linguistic Syllabus 

These modules will promise more freedom than ever to learners and 
teachers; however, a common measure is indispensable for the evaluation of 
language learning and education. In this sense, the evaluation of modules is 
very important for this COE program. As each module is designed 
independently, one may evaluate it individually. But as far as educational 
contents and goals are concerned, a more or less loose unity has been 
realized by adopting a common syllabus design for 17 different languages, so 
that in addition to a traditional analysis of learners’ idiosyncratic characters, 
one can make an interesting contrastive analysis of individual or universal 
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characteristics of second language acquisition (SLA) through 17 different 
languages. Therefore, the Cross-linguistic syllabus is regarded as an 
innovation in this web-based language education system. 


3.4. Linguistic Usage 

The process of developing the TUFS Language Modules is as follows: 1. 
Creation of language materials; 2. Implementation on the WWW; and 3. 
Web-based language education. The first step is to create language materials 
which are appropriate for the language modules. We must ask ourselves 
about the kind of language materials that should be provided. Our conclusion 
is that these language materials should be “usage-based”. The key concept 
here is linguistic usage. Next, we must consider what exactly does the term 
usage mean. The term is highly polysemous. Some researchers claim that 
linguistic usage becomes explicit only through quantitative analysis of an 
enormous corpus. Others argue that usage should be fixed on mutual speech 
acts between a speaker and a listener. Moreover, some may suppose that 
linguistic usage is related to our cognition, for our linguistic knowledge will 
be accumulated through the encounter with new linguistic usages. We also 
find researchers who inisist on the interaction between linguistic and 
extra-linguistic aspects of linguistic usage. In short, the definition of usage is 
not at all unanimous among linguists. The TUFS Language Modules provide 
us with an opportunity to reconsider the significance of usage for linguistic 
research and language education. 

The pronunciation and dialogue modules became available on the 
Internet at the beginning of 2004. At the end of Academic Year 2005, the 
grammar module also became available for 10 different languages. The 
development of vocabulary module is underway. 


4. First International Conference on Linguistic Informatics 

Immediately after the selection of this COE program by the Ministry of 
Education, Culture, Sports, Science and Technology, we undertook the 
preparation of the first conference on Linguistic Informatics. At the end of 
2002, the outline was fixed. On December 13 and 14, the first International 
Conference on Linguistic Informatics was scheduled at TUFS. The 
conference consisted of three sessions: 1. Computer-Assisted Linguistics; 2. 
Corpus Linguistics; and 3. Applied Linguistics. 

It was a great honour for me to organize this international conference, 
which was pariticpated by many guest speakers not only from other 
universities in Japan but also from all over the world. Many graduate 
students, mostly PhD candidates, also presented their papers in the 
conference. Unlike most conferences, proceedings were published before 
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the conference since the conference covered large scientific fields, ranging 
from Computer Linguistics, Philology, Dialectology, Corpus Linguistics, 
Discourse Pragmatics, Applied Linguistics to e-learning, and without the 
prepublished proceedings, the conference participants would not have been 
able to understand the essence of the contributions or to follow the 
discussions. 

Through this conference, we hope to have explained the concept of 
Linguistic Informatics, state of the art field, and presented the challenges this 
field must solve in the future. We hope that this synthesis of different 
scientific fields produces fruitful outcomes and help us to gain insights into 
the futurevision of this science. 


cf. TUFS Language Modules (Japanese version) : 
http://www.coelang.tufs.ac.jp/modules/ 
TUFS Language Modules (Multilingual version) : 
http://www.coelang.tufs.ac.jp/english/modules/ 
Usage-Based Linguistic Informatics: 
http://www.coelang.tufs.ac.jp/ (in Japanese) 
http://www.coelang.tufs.ac.jp/english/index.html (in English) 


In the spring of 2005, the John Benjamins Publishing Company of the 
Netherlands published a series of reports from The First International 
Conference on Linguistic Informatics as well as the results of corpus 
linguistic research as the first and second volumes of the series, 
Usage-Based Linguistic Informatics. 


Usoge-Based Lingwistic Informatics 2 


Usage- Based Linguistic leformatics 1 
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5. Second International Conference on Linguistic Informatics and 
Workshop on Spoken Language Corpora 

On December 9, 2005, a joint workshop entitled Spoken Language 
Corpus - Significance and Its Application - by C-ORAL-ROM and 
Usage-Based Linguistic Informatics (UBLI) was held at TUFS. 
C-ORAL-ROM is a joint project on the spoken language corpora of 
Romance languages collaboratively compiled by the University of Florence 
in Italy, the Autonomous University of Madrid in Spain, the University of 
Aix-Marseille I in France, and the University of Lisbon in Portugal. 

On December 10, the following day, The Second International 
Conference on Linguistic Informatics—Contributions of Linguistics, Applied 
Linguistics, and Computer Science, was held at the TUFS Multimedia Hall. 
The rationale for linguistic informatics was identified and the academic 
position of this field was discussed in various lectures on linguistic analysis 
of spoken language, pragmatic analysis, second-language acquisition, and 
corpus linguistics. 
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Preface 


Asako YOSHITOMI, Tae UMINO and Masashi NEGISHI (Tokyo University 
of Foreign Studies) 


This book is the fourth volume of a series on Usage-Based Linguistic 
Informatics, a product of an ongoing large-scale project, the 21° century 
COE program, conducted at Tokyo University of Foreign Studies. The 
project attempts to establish a new field of study that integrates current 
developments in theoretical linguistics, applied linguistics, and computer 
sciences. On the practical side, the project has been developing e-learning 
materials in 17 foreign languages. On the more research-oriented side, the 
project has encouraged explorations of the newly proposed field of 
Linguistic Informatics in addition to enhancing progress in the three 
constituent fields of study. 

This volume is a collection of papers that responds to the latter aim of 
the research-oriented side of the project, namely, to facilitate applied 
linguistic studies. With the contributions of scholars from various institutes 
around the world, we have been able to put together an exciting collection of 
readings on a wide range of current issues debated in applied linguistics with 
a particular focus on Japan. 

The editors of this volume express their sincere gratitude to the project 
leader, Yuji Kawaguchi, for providing the opportunity, as well as moral and 
financial support, to publish this book. 


December 2005 
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An overview of the chapters in this volume 

This volume consists of two parts: Part I is a collection of papers on 
second language pedagogy, and Part II on second language acquisition. 
Although most papers cannot be strictly categorized as either dealing with 
language teaching or language learning alone, they have been placed in 
either section according to the stronger emphasis they have. 

Starting Part I is a paper on discourse politeness theory and 
cross-cultural pragmatics by Mayumi Usami. In this paper, she introduces a 
framework of Discourse Politeness Theory (DPT), and discusses how this 
theory can contribute to finding ways to solve problems created by the 
transfer of politeness strategies from a learner’s first language to a second 
language in cross-cultural interaction. This theory has been developed based 
on the results of a series of empirical studies, and attempts to enable 
researchers to contrast politeness behavior in languages with and without 
honorifics within the same framework, while minimizing cultural biases. 
Usami’s proposal also aims to broaden politeness research to encompass the 
concept of relative politeness in addition to absolute politeness studied thus 
far in the field of pragmatics. She claims that in order to construct a 
“universal theory of Discourse Politeness,” politeness should be captured 
both from the perspectives of the speaker’s politeness strategies and the 
hearer’s interpretations of the speaker’s strategies. The concept of 
discrepancy between the speaker’s and hearer’s estimation of the degree of 
Face Threat is therefore integrated into this theory. Furthermore, the theory 
includes the concept of DP default which represents the prototypical 
behaviors constituting a particular discourse. DPT and second language 
acquisition or language learning are closely related to each other, because in 
cross-cultural contact, language learners have to identify the DP defaults of 
the target language and culture, and learn them to achieve smooth 
communication with others in the target language and culture. Usami’s 
paper explains the relationship between DPT and cross-cultural pragmatics, 
using examples from cross-cultural studies that can be explicated within the 
framework of DPT. She then discusses the ways in which this theory can 
contribute to solving problems created by the transfer of politeness strategies, 
and suggests ways in which language teachers can integrate the perspective 
of DPT into their teaching. 

In the second paper of Part I, Takashi Suzuki discusses and attempts to 
demonstrate some benefits of teaching and practicing conversational 
storytelling skills in English classes in Japan. Conversational stories (CSs) 
are a discourse genre often found in English casual conversation, and have 
been found in recent years to serve crucial interpersonal functions. CSs, 
however, have not been taught or practiced extensively in English 
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classrooms in Japan, where language for more tangible and/or institutional 
purposes has tended to be given more emphasis. Suzuki argues that 
teaching and practicing CS-telling in English classes will greatly benefit 
learners because 1) the ability to tell CSs successfully will help learners 
develop and maintain social relationships in and outside of the classroom, 
and 2) some characteristics of CSs, such as their goal-orientation and 
necessity for negotiation, among others, make CS-telling a particularly 
effective vehicle for language learning. To demonstrate these advantages of 
CS-telling as a learning activity, transcripts of some actual CSs are shown 
which were produced in conversations among Japanese learners of English. 
In the third paper, Hide Takashima and Rie Sugiura propose an 
integration of theory and practice in grammar teaching. In the last few 
decades, language teaching has seen a great shift from traditional 
grammar-oriented methods to more communication-oriented ways of 
teaching. Findings of theoretical and empirical research have indicated the 
importance of incorporating communicative perspectives into language 
teaching, regardless of whether the context is English as a Second Language 
(ESL), such as in the US, or English as a Foreign Language (EFL), such as 
in Japan. Many language teachers in Japan, acknowledging the necessity of 
communicative ways of teaching and following the Course of Study 
specified by the Ministry of Education, have begun to employ more 
communicative activities in their classrooms. Yet it is still the case that 
most students cannot use English effectively once they go out of the 
classroom, and some cannot do so even in the classroom. Takashima and 
Sugiura propose two changes aimed at solving, at least in part, the inert 
knowledge problem commonly observed in Japan, especially in public junior 
high schools. The first is to view grammar teaching not from a perspective 
of knowledge transmission but from one of skill development. In this view 
of grammar, the object of assessment is not whether the learners know a set 
of rules but whether they can use language appropriately, accurately and 
meaningfully according to the context. In other words, the object of 
assessment in this view should be the product of the “grammaring,” that is, 
the learners’ deployment of grammar skills that have been taught. The 
second proposal they make is to introduce activities called “task activities” 
(TAs), which they believe to play a crucial role, when used efficiently, in 
facilitating the development of language acquisition skills. TAs are 
designed to induce learners at some point to use one particular structure 
selected from among other similar structures to best suit the required context, 
while keeping their primary focus on the message and completion of the 
activity. The introduction of such activities is of paramount importance 
especially in a Japanese context where more input, interaction, negotiation of 
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meaning and feedback are essential in promoting grammatical accuracy, as 
well as in bringing in more opportunities to use English in the classroom. 
The paper concludes that, in EFL conditions, by employing TAs, not Tasks, 
teachers can scaffold their learners’ skill in appropriate usage of grammar 
and provide them with pinpointed feedback, while leaving the primary focus 
on message conveyance at the same time. 

The next two papers deal with language testing. Lynne Hansen and 
Joshua Rowe introduce a computerized speaking test that they have 
developed. The assessment method described in the paper grew out of 
investigations of temporality in speech, and studies of applications of speech 
recognition technology to the evaluation of oral proficiency in a second 
language. Drawing upon previous work, Hansen and Rowe have developed 
an instrument for the elicitation and analysis of speech and conducted a 
successful pilot study in a university ESL program in Hawaii. The test scores 
related significantly to students’ placement level in their English speaking 
classes. Scores automatically calculated for SPEAK test speech samples 
correlated significantly with ratings of the samples by trained human raters. 
The pilot data indicate that the computerized assessment method offers an 
economical test that is predictive of the speaking ability of EFL learners. 

Masashi Negishi also deals with language testing in EFL. The last few 
decades have witnessed the mushrooming of proficiency scales to measure 
foreign language ability. Negishi points out that although these scales are 
designed to provide people with a “common” framework of reference in its 
intended region, they are in fact, “local” from the global perspective. 
Hence, at the moment, it may not be easy to apply one framework to a 
different context. The paper reports the process he used to develop a 
proficiency scale for Japanese high school students based on a 
well-established English proficiency test (GTEC for STUDENTS) and then 
to relate this scale to the Common European Framework (CEFR) in order to 
see its universality and particularity as a language proficiency framework. 
These two scales were developed independently, in totally different contexts. 
However, it was shown that there is a close correspondence between the 
GTEC scale and the CEFR. This seems to indicate that they share a common 
foundation for language development and reinforce each framework’s 
validity. Negishi notes that each framework’s specific needs and contexts 
have produced different angles and different emphasis on aspects of 
language ability. 

The next paper by Alison Stewart addresses the topic of language 
teacher expertise from a social perspective. In the fields of teacher 
knowledge and teacher education, expertise is a quality that has, until 
recently, been seen as the preserve of the individual alone. Recent research, 
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however, has focused attention on the social context in which teachers 
practice their professional skills. The paper examines the ways in which 
social conditions affect teachers’ ability to practice their craft to the best of 
their ability. In a narrative interview study, two experienced EFL teachers 
working at Japanese universities discuss their own and others’ evaluation of 
their teaching skill, compare their teaching with those of others in their 
institution, and comment on opportunities for interaction and challenge 
within the teaching community. Despite the significant differences in the 
teachers’ experiences, Stewart finds that both accounts reveal a common 
insight that, more than individual knowledge, skill or performance, teacher 
expertise must be based on recognition and validation by both students and 
the teaching community to which the practitioner belongs. 

The remaining six papers of the volume discuss second language from a 
learning perspective. Barbara Pizziconi’s paper examines how the 
cognitive transformation necessary to the acquisition of benefactive 
structures is assessed and portrayed by Japanese language learners. 
Acquisition is seen here as a process involving not only increased ease of 
manipulation of linguistic structures (due to extended exposure), but also to 
acculturation to Japanese native metaphors and conceptual framing (due to 
participation in social activities). Learners’ metalinguistic comments about 
the workings of the benefactive verbs and the learning process which they 
are experiencing are critiqued from the viewpoint of the relation between 
what they portray to be their patterns of use or avoidance, and their actual 
performance, measured in quantitative terms as actual frequency of use in 
recorded conversations. Pizziconi explains the gap between perceived and 
actual performance in terms of the linguistic ideologies based on which 
learners construe their learning persona. Learners are shown to be very 
sensitive to the literal meaning of these verbs; crucially, the literal meaning 
of “favor transfer” is almost invariably felt to be alien to the learners’ own 
conceptual frame, and its meaning is often linked to the wider conceptual 
categories of honorification and politeness — a semantic fallacy which may 
be induced or at least allowed by teaching conventions. Frequency of use 
seems to increase with general proficiency and to be associated to an 
enhanced awareness of the less literal meanings of the verbs. In other words, 
fluent use involves a reconceptualization of such verbs as grammatical 
devices which imply, but do not necessarily foreground, a benefactive 
interpretation. Moreover, productive or creative use presupposes a speaker’s 
emancipation from the use of benefactives with a strictly literal meaning of 
“favor transfer.” Pizziconi holds the role of intersubjective practices — and 
by implication, of residence in the target culture - to be vital to the 
development of grammatical as well as pragmatic competence: novel 
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formulations of conceptual categories such as that of “social indebtedness” 
and “positive/negative affect,” which speakers of Japanese must consistently 
attend to, are seen as framing conventions to which learners are gradually led 
through extended exposure and participation in socially meaningful 
interaction, and in spite of apparently resisting cultural beliefs. The paper 
makes the claim that pervasiveness of benefactive structures as grammatical 
devices allows the metaphor of indebtedness as rhetorical device to 
“infiltrate” native framing conventions and become gradually more salient, 
to the point of being exploited with even higher frequency than in native 
speaker data. 

The next paper by Amy Snyder Ohta addresses Vygotsky’s Zone of 
Proximal Development (ZPD), which is increasingly being applied to second 
language acquisition contexts as a tool to help understand how languages are 
learned through social interaction. The paper considers how the ZPD, 
developed by Vygotsky to explain the development of children, can be 
applied to adult language learning. While Vygotsky considered assistance 
in the ZPD to come via direct interaction with an adult or more expert peer, 
for the adult learner, assistance can be obtained not only through interaction 
with another person, but also by interfacing with, for example, books, course 
notes, online helps, and video. Ohta proposes that for the adult learner, the 
ZPD applies beyond the interpersonal setting. Supporting evidence is 
provided from interviews with five advanced learners of Asian languages. 
While for children, the ZPD may exist only in interpersonal settings, 
interview results support the notion that ZPDs are also formed as adult 
learners interface with non-human sources of assistance. Her findings 
suggest that the ZPD itself may be internalized through the process of 
cognitive development. In this way, adults can organize their learning so as 
create ZPDs and can become skilled at managing their ZPDs, as well. Asa 
result, adults can take good advantage of the ZPD in a broad range of 
settings they seek and receive assistance interpersonally or via other media. 

The following two papers both looks at the narrative performance by L2 
learners. Keiko Yoshioka’s paper examines how gestures in L2 discourse 
reflect the interaction between L1 and L2. Specifically, she examined the 
way Dutch intermediate learners of Japanese manually introduce animate 
referents in L2 with L1 Dutch and L1 Japanese as baseline. Quantitative and 
qualitative analysis of gesture production in L2 shows that learners manually 
mark the introduction of animate referents in a manner that is different from 
the norms of L1 Dutch or L1 Japanese. Furthermore, cross-linguistic analysis 
reveals variation in gestural performance between the two native groups. The 
results are discussed in terms of cross-linguistic variation in gesture 
production, and how the interaction between learners’ knowledge of L1 and 
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L2 is reflected in the manual introduction of referents in L2 discourse. 

The paper by Asako Yoshitomi focuses on the use of phrasal verbs in 
narratives by Japanese learners of English. There has been increasing 
recognition of the role that formulaic sequences play in language use and 
language development. The paper focuses on one type of formulaic 
sequence, English phrasal verbs, which have been proven difficult to acquire 
even by advanced learners of English. A comparison is made in the use of 
phrasal verbs in a story telling task between adult EFL learners and ESL 
learners. Since formulaic sequences including phrasal verbs are believed to 
be learned primarily through abundant exposure to natural linguistic input 
and through the use of language in interaction, it was hypothesized that ESL 
learners would use more phrasal verbs and do so in a more native-like 
manner than EFL learners at the same proficiency level when telling the 
same story in English. However, analysis of speech data indicates that even 
among the advanced learners, phrasal verbs are significantly underused by 
both groups of learners compared to native speaker speech despite the fact 
that the ESL learners under study all had three years or more of residential 
experience in English-speaking countries. Retrospective data obtained 
from the learners revealed that learners are to a large extent aware of their 
lack of narrative competence. They also reported avoiding many 
expressions due to deficient vocabulary knowledge. Given the significance 
of the use of formulaic sequences in enhancing fluency and possibly 
contributing to language development, Yoshitomi discusses possible 
theoretical, methodological, and pedagogical considerations for future 
research along this line of investigation. 

The final two papers of the volume take up the use of audiovisual 
materials and computers as tools for enhancing second language learning. 
Tae Umino investigates strategies which learners of a second language use 
for learning from audiovisual media in self-instruction. Despite the amount 
of attention and effort given to the development of audiovisual materials for 
use in self-access self-instruction, there are very few studies which address 
the extent to which learners comprehend the content provided in the 
materials and/or what they do in order to learn effectively from the materials 
in these modes. In order to identify the problems learners face in learning 
with such materials and the strategies for overcoming these problems and 
making the most of the media characteristics, Umino carried out a 
longitudinal investigation of the self-instructional process of seven adults 
learning Japanese using audiovisual media over fourteen months. The 
paper reports the results from the second period of data collection and 
provides a description of strategies used for audiovisual comprehension as 
well as their development over the seven-week period. Based on the 
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findings, Umino discusses strategies which are useful for L2 learning with 
audiovisual media in general as well as implications for materials 
development and methodology for investigating learner strategies more 
generally. 

Finally, the paper by Mark Peterson examines the application and 
impact of network computer technologies in the field of CALL. The rapid 
expansion in the use of network-based tools in language education brings 
forth new opportunities to engage learners in activities that foster L2 
development. Peterson discusses the application of both synchronous and 
asynchronous communication tools in learner-based projects in order to 
identify the aspects of computer-mediated communication that may have a 
role to play in supporting second language development in the online domain. 
The paper also focuses on identifying areas of potential for future research. 
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Discourse Politeness Theory 
and Cross-Cultural Pragmatics: 


Mayumi USAMI 


1. Introduction 

In this paper, I introduce a preliminary framework for a Discourse 
Politeness Theory (hereafter, DPT), which has been developed based on the 
results of a series of empirical studies on discourse behavior (for a review, 
see Usami, 1993d, 1998b, 2001a, 2002a, 2002b). This approach is an 
attempt to enable researchers to contrast politeness behavior in different 
languages with and without honorifics within the same framework while 
minimizing cultural biases and develop a more comprehensive universal 
theory of politeness at the discourse level. This proposal also aims to 
broaden politeness research to encompass the concept of relative politeness 
in addition to absolute politeness, which has thus far been studied within the 
field of pragmatics (for example, Leech, 1983). This is because the notion 
of relative politeness permits the construction of a universal theory of 
Discourse Politeness (DP) as both a system of the principles of motivations 
that induce politeness strategies and a system of the interpretations of 
politeness in verbal interactions. 

The DPT and second language acquisition are closely related. This is 
because in cross-cultural interactions, language learners have to identify the 
DP defaults (see 3.1.2) of the target language and culture and learn these in 
order to achieve smooth communication with others from the target language 
and culture. In this paper, therefore, I will present a more detailed 
explanation on the relationship between the DPT and cross-cultural 
pragmatics, by using examples from cross-cultural studies that can be 
explicated within the framework of the DPT. Finally, I will discuss the 
different ways in which this theory can contribute to finding solutions to 
problems created by the transfer of politeness strategies from one’s first 
language to one’s second language in actual cross-cultural exchanges and the 


| This paper is a revised version of the following paper, which is based on the plenary 
lecture of the CLS International Conference (CLaSIC) held at the National University of 
Singapore from the viewpoint of cross-cultural pragmatics. Usami, M. (2004). “Discourse 
politeness theory and second language acquisition.” Proceedings of the Inaugural CLS 
International Conference (CLaSIC). 719-737. Singapore. 


Presented by: https://jafrilibrary.com 


Presented by: https://jafrilibrary.com 


20 Mayumi USAMI 


manner in which language teachers can integrate the perspective of the DPT 
into their teaching. 


2. Basic definitions 
In the following subsections, I will first define the terms that are crucial 
for a discussion on “politeness” and “politeness theory.” 


2.1. “Politeness” 

Both Western politeness research and Japanese honorifics research have 
merely presented a vague definition of the term politeness; moreover, these 
researches do not clearly differentiate politeness from other terms such as 
deference, respect, and formality, which are occasionally used 
interchangeably. In this paper, the term “politeness” is understood in two 
contexts. In a broad context, it refers to all the different approaches and 
perspectives of the various theories of politeness. On the other hand, in a 
narrow or specific context, it refers to the politeness strategies defined by 
Brown & Levinson (1987) (hereafter, B&L), i.e., the choice of linguistic 
strategies to minimize the Face Threat of a particular act (for a review, see 
Usami, 2000b, 2002). 


2.2. “Normative politeness” and “pragmatic politeness” 

I also differentiate between the concepts of normative politeness, which 
refers to the traditional understanding of the degree of politeness intrinsic to 
“linguistic expressions,” and pragmatic politeness, which is defined as the 
“functions of language manipulation that work to maintain smooth human 
relationships” (Usami, 2001a, 2002a). In other words, pragmatic politeness 
not only comprises politeness resulting from linguistic forms and expressions 
(ie., normative) but also comprises discourse behavior, such as topic 
initiation and the appropriate use of backchannels, speech-level shifts, 
incomplete utterances, context-dependent use of particles, appropriate 
frequency of the use of particles (Usami, 1992, 1993a, 1993b, 1993c, 1994a, 
1994b, 1994c, 1995, 1996a, 1996b, 1998a, 1999c, d, 2000a, b, c, d), 
requestive speech acts, such as prefacing before making a request 
(Kashiwazaki, 1995; Xie, 2000), compliment-reply discourses (Kim, 2000), 
utterances that do not possess linguistic politeness markers (Usami & Lee, 
2003), metalanguage behavior (Sugito, 1983, 1993), and so on. Thus, in 
addition to the sentence-level politeness of linguistic forms, discourse-level 
phenomena also play an important role in pragmatic politeness. In this 
study, I primarily focus on pragmatic politeness, which can be understood as 
one of the effects of interactions on verbal behavior. 
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2.3. “Language use according to social norms” and “Strategic language use” 

In this paper, I employ the notion of language behavior that conforms 
to sociolinguistic norms and conventions to refer to all the literal, normative, 
and conventional language use that exist in the language of a society. In 
Japanese, these are not only limited to the use of honorifics, as mentioned in 
Ide (1982, 1989), but also include such behavior as the non-use of honorifics 
with close friends and the appropriate use of backchannels. Similarly, in 
English, it refers to the norms and conventions constraining linguistic 
behavior, such as the avoidance of slang in formal situations or the 
appropriate use of address terms. 

On the other hand, strategic language use refers to voluntary linguistic 
behavior based on individual choice that shows consideration toward 
positive and negative face, as defined by B&L (see 3.3), irrespective of 
honorific system in the language concerned. For example, while 
communicating in Japanese, a speaker may strategically or unconsciously 
increase the frequency of the use of backchannels in order to indicate his/her 
interest (i.e., addressing positive face) in the interlocutor. In B&L’s 
politeness theory, the term strategic language use encompasses potentially 
unconscious language use, such as an increase in the frequency of the use of 
backchannels and speech-level shifts while communicating in Japanese, and 
such behavior as code-switching and the use of joking in both English and 
Japanese. 


3. Discourse Politeness Theory 

In this section, I will introduce the six key concepts of a DPT, which has 
been developed on the basis of the results of a series of empirical studies on 
discourse behavior (for a review of previous studies on politeness theory, see 
Usami, 1993d, 1998b, 2001a, 2002a, 2002b). 


3.1. Basic concepts 

Essentially, there are six key concepts in a DPT: (1) DP default, (2) 
marked and unmarked behavior, (3) marked and unmarked politeness, (4) 
discrepancy in the estimations of the degree of the Face Threat (De value), 
(5) three types of politeness effects, and (6) relative and absolute politeness. 
Before explaining these concepts, I will first explain the term “Discourse 
Politeness.” 


3.1.1. “Discourse Politeness” 

While there have been a number of researchers who have discussed 
discourse-level factors, such as metalanguage behavior or utterance 
organization (Blum-Kulka, 1990; Kasper, 1990; Leech, 1983; Sugito, 1983; 
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Thomas, 1995 inter alia), no actual attempts have been made to integrate 
these phenomena into a politeness theory. Therefore, according to the 
results of my previous empirical studies (Usami, 1993a, 1993b, 1993c, 
1993d, 1994a, 1994b, 1994c, 1995, 1996a, 1996b, 1998a), in addition to the 
sentence-level politeness of linguistic forms, I introduce the concept of DP 
based on the opinion that discourse-level phenomena play an important role 
in pragmatic politeness. 

DP is defined as “the functional dynamic whole of factors of both 
linguistic forms and discourse-level phenomena that play a part in the 
pragmatic politeness of a discourse” (Usami, 1998b, 2001a, 2002a, 2002b, 
2003). Basically, DP can be used in two ways. Its first use is when 
referring to pragmatic politeness that can only be interpreted at the discourse 
level. However, DP is also used to refer to the “DP default” (see 3.1.2) of a 
certain discourse, which is understood to be the dynamic whole of the 
elements functioning for the pragmatic politeness of that particular 
discourse. 

The DPT involves language use that conforms to social norms and 
conventions and an individual speaker’s strategic language use as well as the 
interaction between these two. This applies to both honorific and 
non-honorific languages such as Japanese and English, respectively. I 
contend that the individual elements in DP, such as the frequency of topic 
initiation and speech-level shifts, as well as DP itself as the functional 
dynamic whole of various elements are more appropriate focal points for 
studies that compare pragmatic politeness across languages with differing 
grammatical structures. Accordingly, the examination of these topics 
would contribute to the development of a comprehensive universal theory of 
politeness. 


3.1.2. “Discourse Politeness Default” 

The notion of DP default is fundamental to the DPT and can be 
illustrated with an example from the Japanese language: 

Focusing on speech levels in Japanese, any utterance is classified as 
containing polite forms (P), non-polite forms (N), or containing no politeness 
markers that are described as non-marked utterances (NM) (e.g., incomplete 
utterances and backchannels). By calculating the frequency of the 
occurrence of each speech level within a specific discourse, it is possible to 
determine the overall ratios of the speech levels. This indicates the DP 
default for the speech levels within that discourse, and the speech level with 
the highest frequency is referred to as the “dominant speech level.” For 
example, in a study of sentence-final speech levels (Usami, 2001b), the 
average ratio of P, N, and NM was 6:1:3. This can be concluded to be 
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indicative of the DP default of the discourse in question. In this case, the P 
is the dominant or unmarked speech level (i.e., occurring in more than 50% 
of the utterances); therefore, using the N becomes marked behavior (see 
3.1.3) and gives rise to particular politeness effects, such as expressing 
empathy with the interlocutor(s), i.e., positive politeness. 

It is important to note that there exists a general DP default of the 
discourse that is unmarked in each specific discourse. Further, there are 
individual DP defaults for each individual element that contributes to DP in 
that discourse. In other words, there are two types of DP defaults: (1) the 
DP defaults of the discourse as a whole and (2) the DP defaults of individual 
elements within the discourse that constitute DP, such as speech levels and 
sentence-final particles. The DP defaults of the discourse as a whole are 
considered as “unmarked discourse”, and the DP defaults of individual 
elements such as sentence-final particles are considered as “unmarked 
discourse elements”. The average frequency of the occurrence of various 
elements, such as speech-level shifts and backchamnels, and the ratio of these 
elements relative to the structure of the discourse and their distribution 
within a particular discourse—which constitute a part of the DP of that 
discourse—are treated as one variable, i.e., as one of the DP defaults for the 
unmarked discourse elements. 

The concept of DP default as a dynamic whole is vital to the DPT. 
This is because it becomes a base parameter for calculating a relative 
politeness function, which is distinct from the politeness functions of its 
individual elements. 


3.1.3. “Marked” and “unmarked” behavior 

In the DPT, identifying the DP defaults of specific types of discourses is 
the first step in understanding the relative nature of politeness. A 
systematic investigation of relative politeness can be conducted by 
examining the movements toward and away from those DP defaults. 
Linguistic behavior that is consistent with those DP defaults is termed as 
“unmarked behavior,” while that which deviates from those defaults is 
termed as “marked behavior.” Marked behavior does not necessarily give 
rise to marked politeness (see 3.1.4) because both of these are distinct 
notions in the DPT. On the other hand, behavior consistent with DP 
defaults is always considered to be unmarked politeness. It is assumed that 
the elements comprising these DP defaults form DP as unmarked politeness, 
which is expected but unnoticed. However, if a hearer notices that 
something is either excessive or lacking with regard to these DP defaults, 
s/he might regard the speaker’s behavior as impolite (for further explanation, 
see 3.1.4). 
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3.1.4. “Marked” and “unmarked” politeness 

In B&L’s politeness theory, politeness is understood as a strategy 
whereby one redresses Face Threatening Acts (FTAs), such as requests, 
which infringe upon another person’s face. However, it has been pointed 
out that in this approach, one cannot adequately explain politeness that arises 
in ordinary conversations where FTAs do not seem to occur. 

In fact, a different type of politeness that does not involve redressing the 
Face Threats can be found in an ordinary conversation. This type of 
politeness is associated with expected behavior, which is only noticed if it 
does not occur and generates perceptions of impoliteness. In the DPT, this 
is termed unmarked politeness. This type of politeness is contrasted with 
marked politeness, which encompasses B&L’s notion of politeness as 
linguistic strategies for redressing the Face Threats. 

In the DPT, unmarked politeness refers to both the state of the discourse 
as a whole and the language behavior that is unconsciously expected. 
When those linguistic behaviors do not occur as expected, the discourse or 
utterance is considered impolite. When a speaker behaves according to 
implicit expectations or the DP default in a given situation, s/he displays 
unmarked behavior, which constitutes unmarked politeness. In contrast, 
unlike unmarked behavior, marked behavior that deviates from the expected 
norm or DP default does not necessarily give rise to marked politeness. 
Marked and unmarked politeness can be distinguished in terms of the ways 
in which they are recognized. 

B&L’s politeness theory is considered to be a theory of marked 
politeness because it primarily focuses on linguistic politeness strategies that 
can be used to redress the Face Threats in situations where one cannot help 
but commit an FTA. 

In general, politeness theory should systematically address both marked 
and unmarked politeness within a single framework, rather than merely focus 
on marked politeness, as is the case in B&L’s politeness theory (see Usami, 
2001a, 2002a, 2002b). 

Although Fraser (1990) mentioned this type of unmarked politeness in 
his framework of “conversational contract” and Watts (1992) discussed the 
same type of behavior in a wider context of “politic behavior,” it can be said 
that neither of them fully developed a comprehensive theory of DP. The 
DPT attempts to systematically address both marked and unmarked 
politeness and consider both the speaker’s and the hearer’s points of view 
within a single framework. 


3.1.5. “Discrepancy in estimation value” 
In the DPT, the “politeness strategy” is determined based on the 
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speaker’s estimation of the degree of the Face Threat of his/her act, and the 
actual “politeness effect’ is determined by the “discrepancy between the 
speaker’s and hearer’s estimations of the degree of the Face Threat of the 
speaker’s act” from the hearer’s point of view. I term the latter “politeness 
effect” and distinguish it from “politeness strategy” in B&L’s politeness 
theory. 

The De value as an index of the actual “politeness effect” is calculated 
by comparing the speaker’s and hearer’s estimations of the degree of the 
Face Threat of the speaker’s act. The “De value” is the value assigned to 
this discrepancy between the speaker’s and hearer’s estimations of the degree 
of the Face Threat. A De value cannot be an absolute numerical value, but 
rather is represented symbolically as distributed along a scale from —1 to 1. 
This is illustrated in Figure 1 below. 


-1 -0 0 +a +1 


(De value scale) $$$ 


Degree of discrepancy 
in the estimations of 


-1 S De < 0-a |0-a 


IA 


De = O0+a|O0+ta < De S$ +1 
degree of the Face . - ” 


Threat (De value) 
Appropriateness of | Insufficient behavior | Appropriate behavior Excessive behavior 
behavior (impolite) (polite) (polite insolence) 


Plus effect 
Politeness effects Minus effects eat Minus effects 
Neutral effects 


Discrepancy in estimations : De = Se —- He 


De: the degree of discrepancy between the speaker’s and hearer’s estimations of the 
degree of the Face Threat of the speaker’s act. 

Se: the speaker’s estimation of the degree of the Face Threat of his/ her act; expressed as 
a value between 0 and 1 

He: the hearer’s estimation of the degree of the Face Threat of the speaker’s act; 
expressed as a value between 0 and 1 

a: Acceptable difference between the speaker’s and hearer’s estimations of the degree 
of the Face Threat of the speaker’s act from the viewpoint of the hearer. 


Figure 1. Discrepancy in estimations (De value), appropriateness of behavior, and 
politeness effects 
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For example, minus-politeness effects include so-called “polite 
insolence” (inginburei); here, the hearer experiences unpleasant feelings 
despite the speaker’s use of polite forms. In the DPT, from the hearer’s 
perspective, polite insolence—which has thus far received little attention in 
honorifics research in Japanese—can be regarded as the result of a speaker’s 
excessive use of polite expressions that extend beyond the acceptable 
variation (+a) defined in the “De value scale” illustrated in Figure 1. In 
other words, the linguistic expressions employed by the speaker in a 
particular situation are excessively polite and go beyond the acceptable 
difference as compared with the hearer’s expectation regarding the 
appropriate linguistic behavior in that situation. 


3.1.6. The three types of politeness effects 

In the DPT, face redressing acts are considered to be a type of marked 
behavior. Three types of effects can arise from marked behavior. (1) plus- 
politeness effects, (2) neutral-politeness effects, and (3) minus-politeness 
effects. These effects essentially result in pleasantness, neutral effects 
(neither pleasant nor unpleasant), or unpleasantness, respectively. The 
neutral-politeness effects at the discourse level are not addressed in B&L’s 
politeness theory, since a notion of unmarked politeness would be necessary 
in order to examine these effects. Moreover, B&L’s politeness theory does 
not systematically treat the minus-politeness effects, which are produced by 
either making no effort to reduce the degree of threat to the hearer’s face or 
using excessive polite forms. 

Thus, the DPT expanded B&L’s politeness theory in scope. This is 
because, in addition to plus-politeness effects, it encompasses the neutral- 
and minus-politeness effects within a unified theoretical framework. 
Moreover, the neutral- and minus-politeness effects are systematically 
explained by introducing the concept of the degree of discrepancy between 
the speaker’s and hearer’s estimations of the degree of the Face Threat of the 
act in question. The discrepancy between these estimations, i.e., the De 
value is represented by a symbolic numerical continuum, as explained above. 
Thus, the minus-politeness effects (or unpleasantness), including both polite 
insolence and rudeness, can be explained by an integrated theory of 
politeness, namely the DPT. 


3.1.7. “Absolute” and “relative” politeness 

A final distinction to be made with regard to the DPT is between 
absolute and relative politeness. The former involves labeling particular 
linguistic forms or strategies as being intrinsically more polite than others, 
for example, the Japanese honorific verb irassharu (“go-Hon”) is considered 
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to be inherently more polite than its non-honorific equivalent iku (“go”). 
However, if one uses honorifics while conversing with someone with whom 
one usually speaks to rather casually (the DP default of that discourse is 
casual speech), it could be implied as sarcasm rather than politeness. 
Similarly, even if one uses non-polite expressions in situations where the DP 
default is polite forms, depending on the context, the effect could be an 
increase in the feeling of solidarity rather than an implication of impoliteness. 
Thus, in the DPT, politeness effects are considered not to be produced by 
merely using polite expressions in an absolute sense, but rather to be 
relatively produced by the “movement” toward and away from the DP 
defaults of the discourse in question. I term this type of politeness as 
“relative politeness.” 

It is important that the DPT includes both the concepts of “De value” 
(discrepancy between the speaker’s and hearer’s estimations of the degree of 
the Face Threat) and “DP default” as a base parameter for calculating 
“relative politeness” effects. Thus, the DPT integrates the interactive and 
relative aspects of politeness effects by including the abovementioned 
concept of “relative politeness.” It is important to include these three 
concepts within a theory of politeness; thus, these three constitute the 
fundamental aspects of the DPT. 


3.2. Politeness effects arising from deviated behavior from the DP default: 

Examples from Japanese and English 

DP has been defined as “the functional dynamic whole of factors of 
both linguistic forms and discourse-level phenomena that give rise to 
pragmatic politeness of a discourse” (lit. “The dynamic whole of functions 
of various elements that exist in both linguistic forms and discourse-level 
phenomena that play a part in the pragmatic politeness of a discourse’) 
(Usami, 1998c, 2001a, 2002a, 2002b). 


3.2.1. Speech-level shifts in Japanese conversations 

Japanese conversations have numerous elements that constitute DP; 
however, for the sake of brevity, I will only focus on speech levels as an 
example of an element of DP. I will explain the relative nature of the 
politeness effects through Figure 2. 

In Figure 2, the largest circle represents a set of functions of various 
elements in DP as a whole. The small circles inside the larger ones 
represent a set of functions of each element in DP, which is hypothesized to 
be factors such as the frequency of backchannels, topic introduction, and 
speech levels. The number of elements is not limited to five as shown in 
the large circles above. The circles from which the arrows are pointing 
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outward represent the unmarked speech levels as the DP defaults of the 
respective discourses. 

In the example in Figure 2, the speech level that deviates from the 
unmarked dominant speech level as the DP default, becomes “marked 
behavior” at the utterance level (N in conversations between people meeting 
for the first time, and P in conversation between friends). 


DP defaults DP defaults 
of conversations between people meeting of conversations between friends 
for the first time or married couples 


(Unmarked speech level = P: polite forms) (Unmarked speech level = N: non-polite forms) 


N P 
Deviation from the unmarked speech level 


(marked behavior) 


Marked behavior gives rise to particular functions 
(“plus-politeness effects,” “neutral-politeness effects,” or “minus-politeness effects”) 


Figure 2._ DP defaults and marked behavior in specific activity-types 


As illustrated by the diagram on the left-hand side of the figure, in the 
case of a conversation between unacquainted people, P (polite forms) is the 
dominant, unmarked speech level as the DP default of the discourse. 
Therefore, the continued use of P maintains unmarked politeness, while the 
use of N (non-polite forms) becomes marked, giving rise to certain special 
functions or effects, such as showing empathy and indicating a topic change 
(Usami, 1995). 

On the other hand, as illustrated on the right-hand side of the figure, in 
the case of a conversation between friends or married couples, the DP default 
or unmarked speech level is contrary to the previous case, ie., N. 
Accordingly, in this discourse, the use of N constitutes the DP default as 
unmarked politeness and can be considered as sufficiently polite. Thus, the 
use of P in this discourse becomes marked behavior, and contrary to the view 
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of politeness as the “politeness level of linguistic forms,” a failure to 
conform to the DP default by the use of a P may result in the 
minus-politeness effect, ie, sarcasm or impoliteness. This might be 
understood intuitively in terms of everyday observations. 

According to the DPT, marked behavior gives rise to one of the 
following three types of politeness effects: (1) plus-politeness effects (e.g., 
expressions of familiarity or closeness), (2) neutral-politeness effects (e.g., 
changing topic), or (3) minus-politeness effects (e.g., sarcasm or 
impoliteness). 

Accordingly, we notice that P can be used either when arguing or to 
express sarcasm in conversations where N is the unmarked speech level of 
the discourse. In other words, in conversations where N is unmarked, the 
use of P, which constitutes marked behavior, can also give rise to the three 
effects mentioned above including minus-politeness effect, despite the fact 
that P itself is a ‘polite form’. 

These examples indicate that, essentially, it is the “dynamics” of 
language use—in a specific situation where the speaker deviates from and 
returns to the DP defaults as unmarked politeness — and not the absolute 
politeness level of the linguistic form that is responsible for occurrence of 
pragmatic politeness effects (Usami, 2001a, 2002a, 2002b). 


3.2.2. The switching of the absolute politeness level of linguistic expressions 
in conversations between English speaking couples 
The same phenomenon can be observed in English and can also be 
explained by using the notion of DP default given in the DPT; _ this 
indicates that it is possible to interpret pragmatic politeness in different 
languages within the same framework. 
The following example is taken from Thomas (1995: 156). 


(1) [Taken froma short story by James Thurber?’ 
A married couple is trying to decide on a restaurant. The husband says: 
“You choose.” 


Thomas maintains that although this utterance is a direct imperative, it 
would normally be seen as perfectly polite because the speech act is what 
Leech (1983: 107-108) terms as “costly to the speaker” or (better in this 
case) “beneficial to the hearer” (Thomas, 1995: 156). This explanation is 
applicable to English; however, it cannot be used to explain a Japanese 


* James Thurber (1963). A couple of hamburgers. Vintage Thurber (p. 103). London: 
Hamilton. 
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translation of this example (“Kimi, erabe yo”). Irrespective of the 
utterance’s “benefit to the hearer,” if the hearer is of a higher social status 
than the speaker or the hearer is someone the speaker has met for the first 
time, it is impossible for such an utterance to be understood as polite. The 
alternative translation “Erande kudasai” has the minimum appropriate 
amount of politeness; however, as compared with the original, it has a higher 
“politeness level” in terms of linguistic form. Further, it is not a direct 
imperative; it is the form of a request. Therefore, Thomas’s argument does 
not hold for this example in Japanese, and we can conclude that one cannot 
translate the politeness effects in English directly into Japanese. What this 
example does show is that due to the strong influence of the various 
structures and characteristics of different languages, it is impossible to 
present a consistent explanation of politeness across languages at the 
utterance level. 

Thomas (1995) further discusses her claim that there is no necessary 
connection between the politeness level of linguistic forms and the politeness 
(effect) arising from those forms with the following example (p. 156) in 
which a married couple is becoming irritated with each another: 


(2) “Will you be kind enough to tell me what time it is?” 
[and later]: 
“If you’ll be kind enough to speed up a little.” 


Thomas explains that in the context of an intimate relationship, these 
utterances “appear inappropriately indirect” (p. 156). However, while 
Leech’s politeness maxims can explain why the utterance in example (1) is 
regarded as polite, they cannot explain why the utterances in example (2) 
cannot be regarded as polite, despite the use of indirect expressions. 
Thomas comments that only “...in the context of an intimate relationship 
they appear inappropriately indirect...” (p. 156); however, she does not 
explain why one would interpret it in this way. 

In the DPT, both examples (1) and (2) can be interpreted by adopting a 
common principle. Direct expressions can be assumed to be the unmarked 
politeness or DP default between English speaking couples. Therefore, the 
direct “You choose” in example (1) constitutes unmarked politeness and is 
thus sufficiently polite (or perhaps it would be more appropriate to suggest 
that it is not rude/impolite). On the other hand, in example (2), since direct 
expressions are the DP default, the wife’s use of so-called “polite 
expressions” became a marked behavior and produced a minus-politeness 
effect (i.e., sarcasm or impoliteness). 
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3.3. Determining DP defaults for research in cross-cultural pragmatics 

According to the DPT, after identifying the DP default as unmarked 
politeness for each individual conversation or discourse, one can examine the 
politeness effects arising from the marked behaviors in that discourse 
interaction. Accordingly, one can interpret the pragmatic politeness effects 
arising in a specific conversation in a relative manner, even if there are 
variations in the discourse content, use of linguistic forms, and relationships 
between the interlocutors in the absolute sense. 

The determination of the DP default of a discourse—which is an 
amalgamation of various elements—includes the determination of the DP 
default of the overall discourse and individual elements, such as speech level 
default, backchannel frequency default, default for the frequency of topic 
initiation by each interlocutor, default for the request sequence pattern, and 
so on. Thus, it is necessary to identify the elements that give rise to 
important functions in the DP of representative activity-types in various 
languages and cultures. It is then necessary to identify the DP defaults of 
both the discourse as a whole and the crucial elements within the discourse. 

Identifying the DP defaults of each element in particular activity-types 
in different languages and cultures is a relatively simple starting point and is 
a topic of significant interest. However, strict adherence to the process of 
determining the DP default of the overall discourse and each activity-type 
within the discourse and then identifying the individual marked behaviors 
that deviate from these DP defaults using the newly collected data requires 
considerable time and effort. Thus, in order to simplify this process, one 
can use the approximate tendencies that emerged in similar activity-types 
from the results of previous studies as the DP defaults for a specific 
discourse or activity-type. For example, with regard to the discourse 
between married couples, there exists considerable data proving that “direct 
expressions” are unmarked in English. In this way, the abovementioned 
utterance (Will you be kind enough to tell what time it is?) in the 
conversation between the married couple can be interpreted to be an example 
of sarcasm rather than politeness. 


3.4, The DPT and associated research 

A number of empirical researches have pursued various interests in 
pragmatics from the perspective of DP. These studies have identified the 
DP defaults of crucial elements in the various activity-types of a discourse as 
the basis of the research. For example, the DP defaults for the frequency of 
topic initiation, distributions of speech levels, and frequency of the 
speech-level shift have been identified for conversations between Japanese 
adults meeting for the first time (Usami,1996a, b, 1998c, 2001a, 2002a). 
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Otsuka (2004) has analyzed the effects of changes in speech style in TV 
debates utilizing the DPT framework. 

Case study findings have also identified tentative DP defaults for the 
use of backchannels at the beginning and the end of conversations between 
Japanese adults meeting for the first time (Usami, 1993a, b, c, d, 1994a, 
1995). DP defaults for the frequency of the use of the particle ne in casual 
conversation between colleagues and in meetings have also been examined 
(Usami, 1997). Kiyama (2005) has identified that there are different DP 
defaults for “substantive disagreement” and “courtesy disagreement” in 
Japanese conversations between friends and strangers. Xie (2000) has 
identified the DP defaults in Chinese and Japanese for sequence patterns in 
request discourses. Olivieri (1999) has identified the DP defaults of speech 
levels in conversations between the Japanese and those between the Japanese 
and Italians meeting for the first time; Usami and Lee (2003) has identified 
the DP defaults of speech levels and their distribution in conversations 
between the Japanese and those between the Koreans. In addition, although 
not planned specifically from the perspective of DP, the ways in which the 
DP defaults for the Japanese with regard to the conversations between newly 
acquainted Japanese and Koreans changed over time were also investigated 
on four different occasions over a set period of time (Oyanagi, 2000). 


4. The application of the DPT to cross-cultural pragmatics and second 
language education 

The DPT is intended to be a universal theory of politeness. However, 
at the same time, the theory integrates the factors that are related to cultural 
norms and customs in speech acts and discourse behaviors by introducing the 
concept of “relative politeness,” which is based on the “DP default” as 
unmarked politeness. Therefore, the first step in applying the DPT is to 
identify the main elements/factors that constitute DP defaults and conduct 
comparative studies on the defaults of these elements with regard to DP in 
various languages and cultures. 

The important constituents for DP may be different in each language 
and culture; moreover, even if the same constituents are important, the DP 
defaults of those constituents may differ for each language and culture. In 
this sense, although investigating the DP defaults for each main activity-type 
in various languages and cultures seems to be an investigation of the 
typology of linguistic behavior patterns, from the perspective of the DPT it 
also means identifying the DP default as unmarked politeness in various 
languages and cultures in order to analyze the effects of marked behavior. 

For example, if the differences in request or refusal behaviors among 
various cultures are examined from the perspective of the DPT, the different 
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“DP. defaults” for request or refusal discourses in each culture can be 
regarded as the basis for identifying “marked behaviors,” which basically 
produce politeness effects in each language and culture. Based on this, one 
can regard such studies as those that do not simply describe cultural 
differences from a cross-cultural perspective, but rather identify the DP 
defaults of specific verbal behaviors in various languages and cultures from 
the perspective of the DPT. In this case, one can utilize the DP defaults as 
the basis for the different perceptions and impressions of specific verbal 
behaviors in each culture. Thus, the DPT can be applied to find ways to 
solve intercultural miscommunication. 

Furthermore, these findings allow a universal explanation of the 
motivations and mechanisms that give rise to politeness strategies and the 
effects that underlie the identified culture-specific behaviors in various 
languages and cultures. 

For example, Xie (2000) examined request discourse in Japanese and 
Chinese based on the data from the discourse completion tests (DCT) and the 
recordings of actual conversations. In the case of the former, an analysis of 
the utterance-level responses showed no differences between requests in 
Chinese and Japanese. However, in the case of the latter, she found that in 
Japanese it was common for there to be a sequence of (1) attention-getter, (2) 
checking the possibility of compliance, (3) supportive strategies (explanation 
for the request), etc., before the appearance of the request utterance itself. 
However, in Chinese, the request utterance followed immediately after the 
attention-getter; thus, suggesting a difference in sequence patterns between 
the two languages. 

If we interpret these results in terms of DP, we can observe that the DP 
default as unmarked politeness with regard to the request discourse is 
different in Japanese and Chinese. In Japanese, going through the elaborate 
sequence before making a request constitutes unmarked politeness; however, 
in Chinese, the short sequence, i.e., an attention-getter, followed by a request 
utterance constitutes unmarked politeness. Based on these results, if 
learners of Japanese who are native speakers of Chinese transfer the 
utterance sequence that constitutes unmarked politeness in Chinese to 
Japanese, their Japanese interlocutors may feel that their requests are either 
abrupt or rude. This might be the case even if the “politeness level of the 
linguistic forms” in their requests is appropriate at the utterance level. 

On the other hand, if one follows the Japanese request sequence pattern 
to make a request in Chinese, this marked behavior may be viewed as being 
cold and distant or as harboring some ulterior motive. Thus, if you consider 
the differences in the DP defaults across various languages and cultures, the 
identification of the DP defaults going beyond the utterance level for specific 
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activity-types can be useful in facilitating smoother intercultural 
communication and clarifying the causes of intercultural miscommunication 
with regard to politeness. 

Similarly, by analyzing the differences in the DP defaults of important 
speech acts in various languages and cultures, one can focus on the 
interactions between native and non-native speakers. This leads to a richer 
understanding of the reasons for intercultural miscommunication at the 
discourse level, moving beyond a focus on grammatical errors and the use of 
honorifics at the sentence/utterance level. Ultimately, this understanding 
may be applied to facilitate smoother intercultural communication. 

The DPT and second language acquisition (language learning) are 
closely related. This is because in terms of cross-cultural interaction, 
language learners have to identify the DP defaults and learn them in order to 
achieve smooth communication with others in the target language and 
culture. Some studies have already begun to analyze natural conversation 
data and conversation teaching materials in order to compile language 
teaching materials with the view to utilize the framework of the DPT (Usami, 
2005; Suzuki et al., 2005; Xie et al., 2003). 


5. Future issues with regard to the DPT 

Future work on the DPT will focus both on validating its assumptions 
through empirical research and further development of the theory itself. 
Validating the DPT involves identifying the DP defaults as unmarked 
politeness for various “activity-types” of discourse in different languages. 
The DP defaults for a particular type of discourse are identified by 
examining the typical examples of that discourse. Thus, identifying these 
DP defaults is somewhat similar to clarifying the sociolinguistic norms and 
customs in language use at the discourse level. 

However, the aim of this research is not to establish model examples of 
discourse, as such, but rather to focus on the deviations from the DP defaults 
in order to develop a better understanding of the relative politeness 
phenomena associated with these DP defaults. Further, the theoretical 
development of the DPT will focus on predicting, interpreting, and 
explaining how politeness functions in human interactions. It is necessary 
to further clarify how the content of the utterances and the speaker’s 
intentions are related to their various effects, such as expressing empathy 
with others, picking fights, or simply emphasizing the prepositional content 
of the utterances. The issues that need to be further studied can be 
summarized under the following four main themes: 


(1) Systematizing the relationship of the interactional politeness effects between the 
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utterance content and the politeness level of its linguistic forms. 

(2) Systematizing the process of identifying and predicting the politeness effects 
(plus- , neutral- , and minus-politeness effects) arising from a marked behavior. 

(3) Systematizing the politeness effects associated with the utterance sequences. 

(4) Theorizing about the speaker’s “intentionality” of committing (or not 
committing) FTAs. 


6. The DPT as a theory of interpersonal communication 

The ultimate aim of the DPT is to establish a universal theory to 
investigate and compare politeness effects in languages with and without 
honorifics, such as Japanese and English. The DPT has a number of 
innovative aspects including expanding the scope of research beyond that 
encompassed by B&L’s politeness theory to the discourse level and defining 
the term “politeness” operationally as a relative phenomenon involving the 
interaction from both the speaker’s and hearer’s perspectives. The DPT 
also introduces the notion of “relative politeness” by incorporating the notion 
of the DP defaults of particular activity-types of discourses as unmarked 
politeness; this enables the DP default to serve as the basis for an analysis 
through which deviations as marked behavior become apparent, thereby 
generating actual politeness effects. 

The DPT differs from B&L’s politeness theory, which emphasizes the 
speaker’s estimation of the degree of the Face Threat. In other words, when 
considering the actual politeness effects, the DPT incorporates not only the 
use of the speaker’s politeness strategies based on his/her estimation of the 
degree of the Face Threat of the act in question but also the discrepancy 
between the speaker’s and hearer’s estimations of the degree of the Face 
Threat of the speaker’s act. Hence, as shown in Figure 1, when the 
discrepancy between the speaker’s and hearer’s estimations of the degree of 
the Face Threat of the speaker’s act is approximately zero or within 
acceptable variations (0 + a), it is regarded as appropriate behavior, 
regardless of the politeness level of the linguistic forms themselves. In 
other words, it is assumed that the actual politeness effects are assumed to 
arise from the discrepancy between the speaker’s and hearer’s estimations of 
the degree of the Face Threat of the speaker’s act’. Furthermore, in the DPT, 
the “differences in their perceptions of the prototypical patterns or schemata 
of specific activity-types” are also regarded as the “DP defaults” of those 
activity-types and assume an important role in the overall theoretical focus. 


3 oe ‘ F ee 
It is assumed that the speaker acts on the basis of his/her own estimation of the degree of 
the Face Threat. Cases in which the speaker intentionally threatens the face of the 
hearer through linguistic behavior are treated separately. 
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The above discussion implies that the hearer’s perspective and the 
discourse-based perceptions of both the speakers and hearers are given more 
weight in the explanation of pragmatic politeness. Thus, this approach 
incorporates the discourse-based relative perception of human interaction as 
a key aspect and is the first to systemize politeness at the discourse level. 
This is the primary reason for terming this framework the DPT. 

In the DPT, politeness is a general term encompassing not only absolute 
politeness, or the speaker’s politeness strategies, but also the relative 
politeness effects arising through deviated behaviors from the DP defaults of 
various activity-types of discourses. The aim of the DPT is to investigate 
the universality of the mechanisms underlying these types of discourse 
behaviors as well as culture-specific politeness strategies that arise out of the 
universal motivations for smooth human relations and interpersonal 
communication. In this sense, this theory can be regarded as a theory of 
interpersonal communication. 


7. Conclusion 

In this paper, I have discussed the key aspects of the DPT by focusing 
on the concept of relative politeness and the interactive and dynamic nature 
of politeness strategies and politeness effects. Several other issues are 
being examined within the framework of the DPT and its connection to 
second language acquisition. Further, there remain a number of unresolved 
issues that have been mentioned in this paper. These issues, although 
related to the DPT, are also important aspects for future research in the fields 
of cross-cultural pragmatics, interpersonal communication, and second 
language acquisition. In other words, all these issues are related to our 
approach to systematize the functions of interactivity, dynamics, and 
relativity in interpersonal communication. The further development of this 
theory will not only lead to a more comprehensive theory of politeness but 
may also contribute to the further development of theories of cross-cultural 
pragmatics and interpersonal communication as well as the application of 
this theory to second language acquisition. 
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Teaching Conversational Storytelling Skills 
to Japanese Students of English — Why is it 
Necessary and What could be Taught?’ — 


Takashi SUZUKI 


1. Introduction 

English conversation classes in Japan have tended to focus on teaching 
the skills and aspects of language that are necessary for explicitly identifiable 
goals or purposes such as specific speech acts, tasks/situations, or 
professional/academic needs. However, in our daily lives, we often talk for 
the sake of talking without having such specific goals. In recent years, such 
“casual conversation” has been found to serve the crucial social functions of 
building and maintaining interpersonal relations (Coates, 1997; Coupland, 
2003; Eggins & Slade, 1997; Holmes & Stubbe, 2003). With the world’s 
increasing globalization and Japan’s burgeoning multiculturalism, the social 
functions of conversation deserve greater attention in the field of English 
education in this country. 

One discourse genre that has been found to appear commonly in casual 
conversation in English is “conversational stories” or “(conversational) 
narratives.” Research has found that there is a wide range of knowledge and 
skills that native speakers of English utilize in order to narrate conversational 
stories successfully. In this paper, I will focus on this discourse genre in 
casual conversation and discuss the reasons for and the benefits of 
incorporating the knowledge and skills required for such storytelling into 
Japanese EFL classes. Following the definition of a conversational story (CS, 
hereafter), I will discuss the importance of CSs from the social and 
pedagogical perspectives. Some aspects of CSs that could possibly be 
incorporated into English conversation classes will then be suggested. Finally, 
I will provide an analysis of three story-like chunks of talk produced by a 
Japanese EFL learner in order to illustrate the abovementioned points and 
further explore the implications of incorporating storytelling into EFL 
classes. 


' An earlier version of this paper was submitted to the Department of Linguistics, 
Macquarie University, in partial fulfillment for the degree of Doctor of Applied 
Linguistics. Conversational data is reused in this paper with the renewed consent of all 
the participants. 
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2. Importance of conversational stories for language learning 

In this paper, a “conversational story” is defined as “a description of the 
speaker’s or others’ past experience, which has a ‘point’ and is realized with 
a longer turn at talk,” although there are many boundary cases in naturally 
occurring conversations.” 

The ability to narrate conversational stories successfully is important for 
learners as well as native speakers of English because of their interpersonal 
functions. CSs are found to serve functions such as expressing solidarity 
(Johnstone, 1993), achieving intimacy (Coupland & Jaworski, 2003), 
assigning responsibility (Mandelbaum, 1993), and prescribing behavior 
(Eggins & Slade, 1997). Such functions of CSs correspond to the 
“interpersonal motivation” of “affiliation” and “control” (Macintyre, 
Clément, Doérnyei, & Noels, 1998) and will help learners to build and 
maintain social relationships. 

However, the ability required for a successful narration of CS is quite 
complex and may not be easily acquired by many non-native speakers 
(McCarthy, 1995:200), particularly those in an EFL context where there is 
little exposure to CSs. In Bachman & Palmer’s (1996) framework, this 
ability appears to include textual knowledge (e.g., the structure of a story), 
functional knowledge (e.g., manipulative functions), and sociolinguistic 
knowledge (e.g., appropriateness) in addition to syntactic, lexical, and 
phonological knowledge. Further, all this knowledge must be accessed and 
applied in “real-time” as the story progresses. 

Considering the importance of the functions realized with CSs and the 
difficulties involved in their production, it appears evident that the 
knowledge and skills of storytelling should be included in English curricula. 
However, in Japan’s (and possibly other EFL countries’) English classrooms, 
conversational storytelling has not been taught or practiced extensively.’ 

Moreover, from a pedagogical perspective, practicing conversational 
storytelling per se is expected to have positive effects for language learning. 
Since the relationship-building function of storytelling can be performed 
authentically in a classroom setting (unlike, for example, “complaining” or 
“shopping”), storytelling activities will offer an opportunity for genuinely 


See Tannen (1984: 97), Ervin-Tripp & Kuntay (1997: 135), or Stein (1982) for 
discussions on the definition of this genre. 

One possible reason for this is a common perception that casual conversation is “trivial” 
and “nothing happens” in its course (Eggins & Slade 1997: 17), and therefore, it is not 
worth teaching. Another possible reason is the egalitarian and cooperative orientation in 
Japanese education, which may have led teachers to favor “dialogic” conversation in 
which participants take frequent and equally distributed turns at talk. An exception to this 
tendency is a formal/institutional genre of talk such as debates or presentations. 
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meaningful communication, which is generally considered crucial for 
effective language learning (Rivers, 1980, cited in Schmidt, 1981). Further, 
the desire to narrate CSs appears to be fairly universal, and therefore, it is 
likely that storytelling increases the willingness of many learners’ to 
communicate in English, which will also contribute to effective learning 
(Macintyre et al., 1998).* 

In sum, conversational storytelling is important for learners not only 
due to its interpersonal functions but also its potential as a vehicle for 
language learning. Learners also need to practice such storytelling due to the 
difficulties involved in it. Thus, there are benefits in as well as a requirement 
for incorporating conversational storytelling into EFL classes. 


3. What could possibly be presented to learners? 

In order to incorporate conversational storytelling skills into 
conversation classes, we first need to recognize what is already known about 
such stories and can therefore be applicable to language teaching. Although 
various aspects of CSs have been explored, in this paper, I summarize three 
points that are likely to benefit learners in producing successful CSs. They 
concern (1) the functional/structural sub-categorization of CSs, (2) 
evaluation in CSs, and (3) the topical relevance of CSs. 

Following Plum (1988, cited in Eggins & Slade, 1997: 236), Eggins & 
Slade (1997) divided stories narrated in casual conversation into four 
sub-categories based on their structure and function: narratives, anecdotes, 
exemplums, and recounts. A narrative, for example, has a canonical pattern 
of “(Abstract) — (Orientation) — Complication — Evaluation — Resolution — 
(Coda),” while a recount has a simpler pattern of “(Abstract) — Orientation — 
Record of events — (Coda)” (Eggins & Slade, 1997: 268). Narratives and 
anecdotes often have the functions to entertain or interest the audience and 
provide an opportunity to “share a reaction” (1997: 243). An exemplum is 
used to present a “moral point” (1997: 257), while the purpose of a recount is 
simply “to tell how one event leads to another” (1997: 259). A similar genre 
— although not categorized here as a CS — is a “report” (Polanyi, 1982), in 
which the point (i.e., why the events told are considered worth reporting) is 
left implicit.” Polanyi (1982: 515) provides an example of a report 
delivered by a child when he was asked by his mother about his day at 


Here, I refer to learners’ “desire to communicate with a specific person” (Macintyre et al., 
1998: 548) within the activity of storytelling at a specific time/place, rather than 
“willingness to communicate” in their framework as a more general personality trait. 

The terms used for the sub-categories of stories/narratives differ among some scholars. 
For example, Toolan’s (2001: 195) definition of a “recount,” appears to correspond to 
Polanyi’s (1982) definition of a “report.” 
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school. In this paper, a report, in Polanyi’s sense of the term, is not 
considered as a CS because according to our definition, a CS must have a 
point. Since a mismatch between function and form may cause “pragmatic 
failure” (Thomas, 1983), the knowledge of these sub-categories of CSs and 
similar genres is likely to benefit EFL learners in successful storytelling. 

“Evaluation” refers to “the means used by the narrator to indicate the 
point of the narrative, ... ” (Labov, 1972: 366)° and includes means such as 
an explicit evaluation of the event (e.g., “the most embarrassing moment”), a 
description of an action that illustrates the point of the story (e.g., “they 
started laughing”), and the use of intensifiers (e.g., gestures, intonation, 
repetitions), among others. Although the exact range of evaluative devices is 
not entirely clear, it is generally recognized that storytellers do use such 
devices, and the failure to do so well can result in the audience not “getting 
the point”. 

Stories told in naturally occurring English conversation are found to 
occur often in “rounds”, in which “speakers exchange stories of personal 
experiences that illustrate similar points” (Tannen, 1984: 100), or, to be at 
least topically related to another speaker’s story (Cheshire, 2000: 235). 
Learners need to be aware that CSs are often thematically interrelated in 
some way and do not occur as a random succession of totally unrelated tales. 

Thus far, I have provided a rationale for teaching the knowledge and 
skills of conversational storytelling and have introduced three aspects of CSs 
that can possibly be applied to language learning/teaching. In the remainder 
of this paper, I will analyze three story-like chunks of talk produced by an 
EFL learner and will attempt to illustrate the need for and the benefits of 
teaching and practicing conversational storytelling in EFL classrooms. 


4. Analysis of a report and CSs produced by a Japanese EFL learner 
In this section, I will analyze three story-like chunks of talk, one report 
and two CSs, that were produced by a Japanese learner of English. 


4.1. Data collection 

The report and the CSs were collected in the following manner. Four 
(initially, three) learners who were classmates in an English program at 
college participated in the simulated conversations. All the story-like chunks 
of discourse produced by one of the participants (1 report and 2 CSs, with a 
total time of 8 minutes and 20 seconds) were then transcribed. The report 
occurred in a “free” conversation with minimum instructions. The 


2 Although Labov (1972) only discusses evaluation in narratives, evaluation has been 
observed in the other types of CSs as well. 
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participants were asked to have a chat imagining that they were waiting for 
the teacher to come to class. The two CSs were recorded after a 15-minute 
“mock lesson” on CSs. In this lesson, by partly using the participants’ 
introspection on casual conversation in their L1 (Japanese), they were 
introduced to the three aspects of English CSs that were discussed earlier, 
namely, the function and structure of CSs, evaluation in CSs, and their 
topical relevance. They were then asked to attempt narrating such stories 
(those with a point, evaluated, and topically relevant) if they could, while 
chatting in the same imaginary situation as before. 


4.2. Analysis 

In this section, I will provide a linguistic/interactional analysis of the 
report and the CSs produced by the learner. This will be followed by a 
discussion of the results and their implications for language 
learning/teaching in the next section. The transcripts are provided in the 
Appendix. 


Report “Spring Vacation” 

This report was the only story-like chunk of talk produced by the 
learner (“L” in the transcript) in the “free talk” stage. The report began in 
line 7 after another participant, O1, asked L about the spring vacation. From 
the functional/structural viewpoint, this was a report rather than a CS 
because the overall point of the “story” was unclear, although it was 
evaluated in some parts (line 24, “very fun”; line 36, “very great”; and line 
62, “I hate that!”). Further, the events described had little causal or thematic 
unity beyond the temporal coherence.’ 

From an interactional perspective, it can be observed that this report 
was not only triggered by O1’s question in line 7 but was also constantly 
steered by O1’s subsequent questions (in lines 12, 16, 37/39, 42, 46, 53, and 
64). This was particularly evident in lines 37/39 and 64 where O1 shifted the 
course of L’s report. Thus, the entire report seemed to be led more by O1 
than by L, who followed O1’s lead rather passively. Throughout this report, 
tum taking was based on a question-answer sequence, which is a typical 
turm-taking pattern in language classes. 

Apparently, this report was delivered successfully without much 
miscommunication (except for a repair in lines 43-44); however, the 
response from the listeners was relatively limited with laughs/chuckles 


” This report could be divided into three shorter segments beginning from lines 7, 21, and 
65, respectively; however, the points of the segments (particularly, the first two segments) 
will not be clear; they will still be categorized as “reports” rather than as CSs. 
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occurring only once among them (line 63). 


CS1 “The Needle” 

This CS, which began in line 12 (after a failed effort in line 6), was the 
first of the two stories narrated after the brief mock lesson on CSs. This story 
can be categorized as a narrative as it was about a problematic, possibly 
dangerous situation (i.e., a “complication”) and was apparently told to raise 
interest in the audience and/or share a reaction with them. The story was 
evaluated in many ways, including a report of the storyteller’s own reaction 
(line 33), emphatic prosody (lines 33 and 43), an explanation of what might 
have happened (line 52/to use Labov’s (1972) term, “comparator”), and 
explicit evaluations (“horrible” in lines 52 and 58). 

From the interactional perspective, although the CS was prompted by 
O2 (line 5), the storyteller appeared to be in control of the story once it 
began. Most of the questions by the other participants (lines 28, 42, 44, 47/49, 
and possibly 20, 30, and 36) were requests for confirmation or clarification 
and did not affect the course of the story. 

Although there was an obvious case of miscommunication (line 41-), it 
was dealt with and eventually solved through negotiation between the 
storyteller and the listeners. In terms of the number of laughs/chuckles, this 
CS received a better response from the listeners than the report “Spring 
Vacation,” eliciting 5 laughs/chuckles from them (lines 13, 15, 35, 54, and 
62). 


CS2 “The Banana” 

This CS is probably best categorized as an anecdote in Eggins & 
Slade’s (1997) term, since it is about an interesting event that is not 
particularly problematic’ and is told to amuse the listeners (line 1, “There is 
a funny story”). Evaluation was performed explicitly (e.g., “funny” in line 1) 
and through an onomatopoetic expression (line 46). 

In terms of interaction, the storyteller appeared to be in control. The 
questions from the listeners (lines 24, 26, and 30) were merely requests for 
clarification or confirmation and did not shift the course of the CS. One of 
the listeners contributed in the form of negotiation with the storyteller in 
lines 42-44 when she failed to produce a lexical item needed for the story. 
The same listener also narrated a CS about a similar personal experience, 
thereby initiating a story round (line 57-). 


: However, the part about the approaching bicycle (line 38— Please check if there should be 
a line number after “—“ to indicate the end, or “onward”) can be considered to be the 
“complication” stage, and the whole story can be observed as a borderline case between a 
narrative and an anecdote. 
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This CS appeared to have been received enthusiastically by the listeners, 
judging from the number of simultaneously produced laughter (in more than 
ten lines, e.g., 20, 23, and 27). 


4.3. Implications for English teaching/learning 

In this section, I will first account for the absence of CSs in the free talk 
stage of the recording and then discuss some aspects of CSs observed in the 
previous sub-section that are likely to affect language learning. 

While the possibility cannot be denied that the absence of CSs in the 
free talk stage of the recording was purely accidental, it could have been a 
case of deliberate avoidance by the learner. In my experience as an English 
teacher/learner, I found that many Japanese EFL learners have a tendency to 
avoid narrating CSs because of the complexity and difficulty of the task. The 
narrator of a CS is obligated to express the point of the story through various 
means of evaluation, while the person who delivers a report is “released ... 
from the responsibility of asserting the relevance in his telling“ (Polanyi, 
1982: 515). Since it has been observed that learners tend to avoid failure by 
choosing an easy task (Oxford & Shearin, 1994: 19), it appears likely that the 
learner “L” chose not to produce a CS for this reason. 

While a report has functions that are different from those of a CS and is 
by no means “inferior” per se, learners may not acquire the skills/knowledge 
needed for the production of CSs if they continue to avoid them. As a result, 
they might produce a report (or another genre) in a situation where a CS is 
required or appropriate (e.g., a story round at a party), or produce a CS that 
resembles a report in form (i.e., a “pointless” story). Both of these cases 
constitute pragmatic failure (Thomas, 1983) and may “reflect ... badly on 
him/her as a person” (1983: 97). For these reasons, I believe that it is 
necessary for EFL teachers to encourage or “push” learners to narrate CSs 
rather than reports. 

Moreover, in light of the findings in recent studies on language learning, 
particularly motivation, some of the aspects of CSs analyzed in the previous 
sub-section appear to suggest that CSs may be a particularly effective vehicle 
for language learning. 

First, conversational storytelling is a goal-oriented activity. In a CS, the 
storyteller’s goal is to make the audience understand the point of the story. 
While trying to attain this goal, the storyteller receives constant feedback 
from the audience in forms of laughter, exclamations, and 
confirmation/clarification as we observed in CSs 1 and 2. When the story is 
narrated successfully, the storyteller usually receives positive feedback, 
which can be considered as a “reward.” According to the goal setting theory, 
all these aspects of storytelling (goal orientation, constant feedback, and 
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reward) are likely to have positive effects for language learning (Oxford & 
Shearin, 1994: 19). 

Second, because of the goal orientation of CSs, conversational 
storytelling gives rise to the need for negotiation of meaning, which is also 
claimed to promote language learning (Long, 1996). This is because in a CS, 
while the storyteller is required to clarify the point of the story, it is the 
listeners’ responsibility to understand the point and accordingly provide 
appropriate feedback both during and after the narration. Since 
miscommunication in a CS is likely to prevent listeners from providing 
appropriate feedback, it must be avoided or repaired by all means. Thus, in a 
CS, negotiation of meaning occurs frequently, as observed in the presented 
CSs. According to Long (1996), negotiation of meaning facilitates language 
learning because it “connects ... internal learner capacities, particularly 
selective attention, and output in productive ways” (1996: 452). However, 
findings in previous research suggest that the context of the language 
classroom may promote little negotiation of meaning (Foster, 1998; Lyster, 
1998, both cited in Tarone, 2000: 192). Therefore, conversational storytelling, 
which provides many opportunities for negotiation, appears to be a 
particularly important learning activity in the classroom setting. 

Finally, when narrating a CS, the storyteller tends to assume an active 
role in conversation. As observed in the above analysis, while the storyteller 
stays in control during a narration of a CS, a report can be started and steered 
by the listener(s) with the storyteller passively following his/her lead. This 
latter situation is observed commonly in EFL classes when the teacher asks a 
question such as “So, how was your weekend, Takashi?” and continues to 
ask other follow-up questions. Though in a slightly different context, 
Dickinson argues that “enhanced motivation is conditional on learners taking 
responsibility for their own learning” (Dickinson, 1995: 173-74, cited in 
Déornyei & Csizer, 1998). Thus, this “activeness” on the part of the speaker, 
which narrating a CS entails, possibly makes conversational storytelling an 
effective activity for language learning. 


5. Conclusion 

CSs serve the crucial social functions of building and maintaining 
relationships. Moreover, they are an effective tool for language learning 
since they enable meaningful communication within the context of the 
classroom. Through the analysis, it was also found that such storytelling is 
likely to enhance learning by increasing the motivation of the learner in 
multiple ways. Therefore, it seems reasonable to suggest that conversational 
storytelling be given greater importance in EFL classrooms in Japan and 
possibly in similar EFL contexts/countries. 
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Although only three aspects of CSs_ (structural/functional 
sub-categorization, evaluation, and topical relevance) are presented in the 
mock lesson discussed in this paper, research has yielded a wider range of 
findings about CSs in English. These findings concern formulaic expressions 
used in CSs (Norrick, 2000; Sacks, 1972), the appropriateness of topics in a 
particular context (Coupland & Jaworski, 2003), the frequency of CSs 
(Tannen, 1984), and listener participation including, for example, the 
co-construction of stories (Cheshire, 2000; Coates, 1997). The knowledge of 
these aspects of CSs will most likely benefit EFL learners in developing 
conversational storytelling skills, and it should be considered for 
incorporation into language curriculum in the future. 

One factor related to second language acquisition that is not discussed 
in this paper is L1 transfer. This omission is due to the fact that little is 
known about the differences (and similarities) between English and Japanese 
CSs at present. More research is required in this area in order to determine 
exactly what needs (or does not need) to be taught to Japanese learners about 
English CSs. 
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Appendix 


Note: Transcription conventions are used following van Lier (van Lier, 1988). L represents 


the learner who delivers the report and narrates the CSs, and O1, O2, etc., represent 


the other participants. / / indicates overlapping conversation. = indicates that there is 


no gap between two turns. ? indicates rising intonation (not necessarily a question). 


Colons (::, etc.) indicate lengthening of the preceding sound. Double brackets such as 


((laugh)) indicate non-verbal actions or comments about the transcript. Three periods 


(...) indicate a pause of approximately one second. Refer to van Lier (1988: 244) for 


more details. 


Report “Spring Vacation” (3M24S) 


Turn | Speaker Utterances 

yappa haruyasumino hanashidayo (( Literal translation: “After all, the 

1 O1 talk about spring vacation”; Translation: “After all, the best topic is the 
spring break”)) 

2 O2 Hmm 

3 L so..., ... yeah how your vacation 

4 O1 let’s come on! 

5 All ((laugh)) 

6 O1 how was your vacation= 

7 L =vacation, uh..., ... [have..., ... um, 

8 All ((chuckle)) 

9 L um, twice a week, I have a part time job, 

10 O1 hm, 

11 L and, 

12 O1 which one 

13 L which one? 

14 O1 hm 

15 L what do you mean 

16 O1 kanmidokoro [the sweets shop] /or/- 

17 L /ah/ art (come) school and 

18 O1 oh 

19 L Japanese sweet shop, ..., /both/ 

20 O1 /both/. Oh 

21 L and...um...and...,...,er...,..., ..., | went to Osaka? 

22 O02 oh 

23 O1 Osaka? 

24 L yeah ... for a trip with my friends, and ... it was very fun 

25 Ol oh you went to ... /((unint))/ 

26 L /Uni/ ver /sal/ Studio, 
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27 O1 /yeah/ 
Japan, and ... er Tennoji ((place name)) Station, ((“Universal Studio(s) 
28 L : P 
Japan” is an amusement park located in Osaka)) 
29 Ol hm::/::/ 
30 L /to/ eat Takoyaki, ((chuckle)) ((Takoyaki is a speciality of Osaka)) 
31 O1 Takoya/ki/! 
/there/ is a shop, er, a good shop for Takoyaki, and I went to 
32 L 
Osaka Castle, /too/ 
33 Ol /bm/::: 
34 L Um 
35 O2 hm:: 
36 L it was very great 
37 O1 how was Universal Studio 
38 L hm? 
39 O1 how was Universal Studio 
Studio, oh::: it was, good but ... I... don’t think ... I wanna go again. 
40 L 
((chuckle)) 
41 O2 /hm:::/ 
42 O1 /um::/ so it’s not that much ... interesting place than Disneyland? 
43 L yes ((sic)) 
44 Ol no? 
45 L uh huh 
46 O1 huh. how come ..., ... 
47 L umis:, ..., ...,..., there was ..., ..., ..., the number of attraction, 
48 Ol um 
49 L is very ... 
50 O1 few 
51 L few, 
52. {Ol & O2} //bm:::// 
53 O2 is it scary? 
54 O1 ((unint)) 
no not so much, ... but ..., ... ah! ...,... d’you know (( or “did”)) 
55 L ; 
Jurassic Park? 
56 Ol huh 
57 L [ka:ts] er, ..., that attraction was ... coaster, 
58 O1 hm 
59 O02 oh 
60 L so ...,..., huh, a roller coaster ...,...,... 
61 O02? um 
62 L I hate that! 
63 All ((laugh)) 
64 O1 ..+) -.. $0, and what else did you do? 
65 L er::: and, ... this month, I’Il meet my friends, um::: ..., ... who are ... 
who are studying ... to entrance university? ... for one year 
66 O2 hm:: 
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because they couldn’t ... entrance university last year, um but they, ... 


67 L Ane 
most of them passed exam, to entra- enters university. /so/ 
68 O1 /hm:::/ 
69 O2? _ | huh. that’s great 
70 L um. good news. and how about your holiday 


71 O2 my holiday, ((chuckle) 


CS 1 “The Needle” (2M26S) 
Turn | Speaker Utterances 
mada nai? ((Literal Translation: “Not exist yet?” Translation “You 


oF haven’t come up with a story yet?”)) 
2 L yeah 
3 O02 oh, you have! 
4 L um? 
5 O2 your teeth ... took ((unint)) 
6 L ah! a so (“that’s right”)) ..., ..., /ok I/ have er::: 
4 O1? /you/ 
it is i- is was kowakatta (( “scary”)), abunakatta ((“dangerous”)) 
8 O2 
((laugh)) 
9 Ol abunakatta abunakatta ((laugh)) 


10 O1 de ((“and then”)) 
11 O2 awful /story?/ 
/(oh, oh:)/ ... ((possibly repeating the first syllable of “awful”)) er, 
I had operation, to: ... pull out my teeth, ((unint)) some teeth, and ... 
12 L after that, the doctor and um: ... assistants, were assistants were ... 


looking for something, oh what are they doing what are they doing 
((acting out)) 
13 O2 ((laugh)) 


I- I- er:: ... what’s ... the ..., ... , ... which they were looking for, was 
14 L ? ney 

er:: ... ((using a dictionary)) 
15 O1 er::: (laugh)) 
16 L sorry. oh ... needles ((found the word in the dictionary)) 


17 O02 oh 
18 Ol needle? 


19 L seey eeey ee ((Chuckle)) 

20 Ol what! 

21 L /needle/ 

22 O2 /after you/ ... you ..., ... put your teeth, 
23 Ol um 

24 L yeah 

25 L after yeah. after the operation, 

26 02? | um 

27 L they started to ... look look for needle 
28 O2 why /needle/ 

29 O1 /needle to/ sew you, 
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30 O2 what! 
31 L yes 
32 {Ol & O2| oh::: 
33 L oh /my god/::! ((acting out)) 
34 O1? /my god/::! 
35 All //(augh))// 
36 O2 you saw you saw your haguki ((“gums”)), 
37 L yes 
38 O2 DMS say svisisy: Reagucio’ 
39 L it was disppeared ((sic)) (laugh)) 
40 Ol oh my- 
41 L and finally,..., they... they couldn’t ... look uh find it 
42 O1 you- they couldn’ t? 
43 L yes (laugh)) (( means “no they couldn’t”)) 
44 O2 so what what happened to your /teeth/ 
45 O1 /yeah!/ yeah 
46 L no nothing nothing 
47 O1 so it’s just open? 
48 L What’s- the: ... needle was ... disappeared after my operation so ..., ... 
49 O1 after your /operation/? 
50 L /there is no problem/ for me 
51 Ol oh::: ok I see 
52 L but horrible (laugh))..., ... if the needle is in it my mouth, 
53 | OL &2 | ohz:::! 
54 O2 ok I /understand/! ((laugh)) 
55 O1 /(see)/! ..., ..., 
56 L um 
57 | O1 & 2? | buh::: 
58 L um. horrible. 
59 O1 I thought you need that needle to sew your /things up/ 
60 L no! /no no no/. ((chuckle)) 
61 O02 ok. 
ah na:nda. cho:bibittayo..., ..., ... ((““oh I finally got it. It scared me’”)) 
62 O1 
((laugh)) 
63 Ol ok 
64 O1 anything? 


CS 2 “The Banana” (2M30S) 


Turn | Speaker Utterances 
1 L There is a funny story, 
2 03? | um-hm. 
3 L um ... on the snow snowy day, 
4 |O03&4?|um. 
5 L a itsudakke((“when was that”)), a week ago? 
6 02-4 | ah::: 
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7 L we have a snow, 

8 02-4 | //yeah// 

9 L yeah. and I went to ... see a doctor? and I was walking a street, 
10 04 um 

11 L and ... there were a lot of snow, 

12 | 03 & 4? | //um// 

13 L and near the gasoline station, 

14 | 03 & 4? | /hm/ 

15 L gas station, there was something yellow thing, 
16 | O3 & 4? | //nm hm hm.// 

17 L (there) was, and I ... nani (“what do you call it”)) I approached, 
18 03? | hm 

19 L There was a banana ((chuckle)) 

20 All /Naugh// 

21 L sugoi umatteta ((“it was remarkable, buried”)) 
22 O nani sore:: (“what’s that”)) ((laugh)) 

23 All /Naugh// 

24 03 eh nande? (“but why?”) 

25 L kawaja nakute ((“it wasn’t just the banana peel”)) 
26 03 eh! just banana? 

27 All /Naugh// 

28 L and covered with snow 

29 All /Naugh// 

30 O03 ja ((‘then”))) someone coolered it then? 

31 All /Naugh// 

32 O02 frozen ba/nana/ 

33 L /laugh/ 

34 ? banana 

35 O3 oishii kedo ((“it is delicious but”))((chuckle)) 
36 O2 oishii ((“it is delicious”))((chuckle)) 

37 All /Naugh// 

38 L an the er bike 

39 All /Naugh// 

40 L came 

41 All /Naugh// 

42 L e, nani, hikutte ((what’s, “run over”?)) 

43 O03 run run over? 

44 L hiku? (“run over”?))((chuckle)) it’s run over? 
45 All /Naugh// 

46 L guccha ((onomatopoeia for a crushing sound)) 
47 All /Naugh// 

48 L no o mokugeki shichatta ((“I witnessed that”)) 
49 L kawaisoo ((“pitiful”)) pity ((augh)) 

50 All /Naugh// 

51 ? ah:: nande::? ((“oh why?”)) 
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52 O04? | somebody left 

53 O03 somebody left the banana 

54 O2 maybe it’s a child 

55 O03? | ah: 

56 O4? | ah sokka chibikko kamo ((“right, it could have been a little kid’”)) 
ah chibikkoka ((“yeah, a little kid”))..., ..., ... ah demo (“oh but”) I 

57 O03 . ; 
have slipped with banana’s,= 

58 O2 =oh:: 

59 03 ka- kawa((“peel”)), 

60 L ((laugh)) 

61 03 bananano kawade subetta kotogaaru((“I have slipped on a banana 
peel”)) tsuru! ((onomatopoeia for a skidding sound)) 

62 O4? | suberuno, hontoni? ((“does it really slip?”)) 

63 O03 um, slippy= 

64 L =and before the bicycle come, coming, ..., ..., I take you, take a photo. I 
took photo, 

65 ? um 

66 L by my cell phone. so I’ll show you. ((laugh)) later 

67 |O3 &L? | /Aaugh// 

68 03 banana 

69 All /Maugh// ..., ..15 e103 eee 

70 03? hez::: ...,... banana 
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Grammarization and Task Activities — 
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1. Introduction 

In the last few decades, language teaching methods have been observed 
to undergo a great shift from traditional grammar-oriented methods to more 
communication-oriented methods (see, for instance, Savignon 2001). 
Traditionally, teachers have taught language by providing long and elaborate 
explanations of the grammar rules following which their students have 
translated texts (Brown 2000). The findings of theoretical and empirical 
research, however, have indicated the importance of incorporating 
communicative perspectives into language teaching, regardless of whether it 
is in the context of English as a Second Language (ESL) such as in the U.S., 
or English as a Foreign Language (EFL) such as in Japan (Lightbown 2000). 

According to the Course of Study (The Ministry of Education 1998) and 
in congruence with and acknowledgment of the necessity of communicative 
methods of teaching, many language teachers in Japan have begun to employ 
more communicative activities in their classrooms. In other words, more 
teachers have recognized the need for teaching English in a more 
communicative context in order to enable students to use English among 
themselves when engaging in various activities (Takashima 2000). Despite 
the use of communicative methods, most students are unable to effectively 
communicate in English once they leave the classroom, and some cannot do 
so even in the classroom.’ While this is partly due to the limited exposure 
of learners to the target language (e.g., 3 hours per week at the lower 
secondary level and 4 to 5 hours per week at the upper secondary level), the 
students’ motivation to learn and use English is low, apart from doing so only 
to pass the written entrance examination. They merely desire the 
grammatical knowledge that is sufficient to decipher the long-winded 
English passages and answer the ensuing questions in Japanese. Therefore, 


' We refer to the students of standard public schools in Japan who are taught English three 


times a week for 50 minutes per class, with no other target language input unless these 
students seek it themselves. 
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although they know about the language, they do not know the language. 
Larsen-Freeman (2003) terms this situation as the “inert knowledge 
problem,” i.e., knowledge gained in the classroom remains inactive or inert 
when it is needed spontaneously in communication. 

In this paper, we propose two changes that are aimed at solving, at least 
in part, the inert knowledge problem commonly observed in Japan, 
particularly in public lower secondary (= junior high) schools. The first is 
to view grammar teaching not from the perspective of knowledge 
transmission but from the perspective of skill development. The second is 
to introduce activities known as “task activities” (abbreviated as TAs when 
necessary), which we believe, when used efficiently, can play a crucial role 
in facilitating the development of language acquisition skills.* In the 
following sections, we will first conceptualize the idea of “grammar as a 
skill,” since this view is different from the traditional view of regarding 
grammar as “static knowledge.” Based on this, in addition to the 
“communication activities” in which the particular structures per se are 
practiced, we will discuss the necessity of introducing activities that enhance 
understanding and thereby facilitate the learning of certain grammatical 
structures. Lastly, we will provide the rationale for integrating all these 
constructs when we consider the second or foreign language learning 
processes. 


2. Grammaring — grammar as a skill-acquisition process 

Even though the importance of fostering communicative competence is 
now emphasized (Canale and Swain 1980), teaching grammar — imparting 
knowledge in the form of explanations followed by oral practice — is still 
considered to be an essential component of language teaching (Widdowson 
1988). The definition of grammar is variable (see Celce-Murcia and 
Larsen-Freeman 1999). ‘Traditional or prescriptive grammar has a 
preoccupation with what is correct and with the establishment of a common 
usage of English among all students (Collerson 1994). In other words, the 
traditional view considers grammar as a set of rules or pre-established 
knowledge. It assumes that there is one predetermined correct answer, 
regardless of the context. However, this view of grammar does not appear 
to be appropriate when communication is taken into account, because real 
communication always takes place on-line in a particular context, and the 
correctness or appropriateness of the language used in that context must 


2 : ee 
We do not use learning/acquisition throughout the paper, except where necessary. In 


this paper, learning emphasizes the process, whereas acquisition emphasizes the state of 
learning. 
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always be considered.* For example, the question “how old are you?” 
asked by a doctor to his/her patient when filling a medical record is 
pragmatically appropriate, grammatically accurate, and semantically 
meaningful. On the other hand, if the same question is addressed to a 
stranger whom one may meet accidentally, it is semantically meaningful and 
grammatically accurate, but pragmatically inappropriate in that context. 
Correct language use thus varies according to the context. 

Therefore, it is clear that for communicative purposes, grammar should 
be viewed as an ongoing process rather than being viewed merely as static 
knowledge, and it should always be dealt with in a particular context. 
Derewianka (1998) states that grammar “... is concerned with the language 
choices available to construct different kinds of meaning and how these 
choices vary according to the social context.” Hence, the question that 
needs to be answered is how well can language teachers equip learners to 
choose suitable options for particular contexts in real language use. 

Larsen-Freeman (2001) coined the term “grammaring” to describe 
grammar not as a body of established knowledge in the traditional sense but 
as a “skill that has to be trained” — similar to other language skills such as 
writing or speaking. Thornbury (2001) also advocates this concept of 
grammaring and claims that grammar is more like a process and that it is 
“verb-like” rather than “noun-like.” More concretely, he explains that in a 
conversation, the speaker is choosing (consciously or unconsciously) the 
most appropriate and precise language with which to express his/her feelings, 
while the listener is choosing the most appropriate way of understanding the 
language choices made by the speaker. These on-line interactive choices 
are referred to as the “grammaring processes,” which every language user is 
simultaneously engaged in during communication. Language learners, 
second or foreign, need to learn the grammaring processes that are 
appropriate for particular contexts. The following narrative between two 
Japanese native speakers illustrates one such process.* 


Larsen-Freeman (2001) points out that grammar has three dimensions — form, meaning, 
and use. Accordingly, we make a three-way distinction when referring to correctness: 
when applied to form, we use grammatical accuracy (e.g., *I go to Tokyo yesterday.); 
when applied to meaning, we use meaningfulness (e.g., *Colorless green ideas sleep 
furiously.); and when applied to pragmatic use, we use appropriateness. 

Larsen-Freeman (PC) limits grammaring to only those sentences that are grammatically 
correct. However, we believe that grammaring should be applied to all sentences, 
grammatical or ungrammatical, since they are a product of the learners’ grammar 
systems. 
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Narrative example 
Shizuka: When will you get married? 
Akihiko: When? Idon’tknow. Maybe ... uh ... after thirty. 
Shizuka: Thirty? 
Akihiko: Yeah, after thirty. “ll get marriage — PI] get married ... 


(3 turns) 
Akihiko: ... then if I fall in lover [sic] with her, Vl get marriage with her ... 
(11 turns) 


Akihiko: And ... uh ... when I saw her. “liked to get married to a Chinese girl 
because she’s so beautiful. 
(Gass & Varonis 1989, in Gass and Selinker 2001; emphases added) 


This exemplar shows how grammaring occurs, moving from an 
inaccurate form (1) to an accurate one (2), back to an inaccurate form (3), and 
then to an accurate one (4) again. This progression coincides with the idea 
of the U-shaped growth suggested by Kellerman (1985). What appears to 
be regression is actually a necessary step backwards so that a move toward a 
robust and effective system can be constructed. 

Another example, a conversation between a teacher and a student, is 
presented in order to clarify the situation in which grammaring is applied to 
grammatical sentences. 


Teacher: What are you doing, Yukiko? Have you done your homework yet? 
Yukiko: Look at this! Ive done it already. I did it a long long time ago. 


In this exemplar, in response to the teacher’s question (5), the learner 
uses the present perfect (6) and the past tense (7) forms appropriately as well 
as accurately, depending on how the learner perceives the homework in the 
present and the past, i.e., grammaring. 

In this section, we have claimed that grammar should not be regarded 
simply as static knowledge but should be considered as a process in which 
that knowledge is being applied according to the context. This application 
process is termed as “grammaring” — grammatical knowledge being applied 
on-line according to the context. 


3. Grammarization — the product of grammaring 

The concept of “grammarization” was proposed by Takashima (2001) to 
complement the idea of grammaring and to refer to both grammatical as well 
as ungrammatical sentences. He claims that if grammaring is a process 
wherein the learner has to develop him/herself with the help of the teacher, a 
corresponding construct termed as grammarization is needed to represent the 
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product of grammaring at a certain point in learning. In other words, the 
end product of grammaring needs to be measured against a specific objective 
to check whether or not it has been learned. To put this differently, 
grammarization represents the grammaring process frozen at a certain point 
in time during the language learning process. Since grammaring as a 
process cannot be observed, grammarization is the only means to check the 
gap between a learner’s language development and a target level or the 
extent of progress made by a learner in terms of grammaring. By observing 
such grammarization, the teacher can provide useful feedback and/or 
feed-forward to the learner, thereby nurturing the development of 
grammaring in the future. 

In the narrative example given in the previous section, as a result of 
grammaring, at a certain point, the inaccurate “agrammarized” form — “T’ll 
get marriage” — and, at another point, the accurate grammarized form — 
“T’ll get married” — appear through correct grammaring. Figure 1 
illustrates this phenomenon from both chronological and synchronic 
viewpoints. It depicts the manner in which grammaring occurs over time, 
resulting in both agrammarized inaccurate and target-like accurate sentences 
being produced each time. 

Thornbury (2001) states that grammaring “is a kind of organic process 
that, in the right conditions, grows of its own accord and in its own 
mysterious way.” In other words, grammaring cannot be taught but can 
only be developed or facilitated under the “right conditions” over time; the 
creation of these conditions is an important responsibility of the teacher (see 
also Nunan 1999). 

In the following section, a communication activity known as a TA 
aimed at facilitating grammaring is proposed. This introduction is 
inevitable particularly in the Japanese context in which increased input, 
interaction, negotiation of meaning, and feedback are essential in promoting 
grammaring, i.e., grammatical accuracy, as well as in providing more 
opportunities to use English in the classroom. 
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Grammaring | — I’ll get married (target sentence) 

r 
9 | 
g Grammaring | — (I liked to) get married (TL-grammarized) ...(4) 
o ee 
2. 
2 Grammaring | — I’ll get marriage (agrammarized) ...(3) 
2 we 
5 Grammaring | — I’ll get married (TL-grammarized) ...(2) 
=. 
ga 


i 


Grammaring 


— J’ll get marriage (agrammarized) ...(1) 


Synchronic Grammaring = Grammarization 


(Takashima and Sugiura 2002) 


Figure 1. Grammaring Through Chronological and Synchronic Viewpoints (The numbers 
in the circles correspond to those in the narrative example on page 4.) 


4. How to facilitate grammaring 

Based on the idea that grammaring is a skill that needs to be developed 
under the right conditions, the teacher should be concerned about how the 
development of this skill can be facilitated. One way to achieve this is to 
engage the learner in the target language behavior in a situation that 
simulates the real world and through which he/she practices grammaring. 

In the field of second language acquisition, employing “tasks” in the 
classroom has gained the increasing interest of researchers (e.g., Nunan 
1989; Long and Crookes 1992; Willis 1996; Skehan 1996, 1998; Bygate, 
Skehan, and Swain 2001; Ellis 2003). Although different researchers have 
defined the concept of a task differently, Ellis (2003) provides the following 
comprehensive definition of a task: 


(1) A task involves a primary focus on message conveyance. 

(2) A task involves some kind of gap. 

(3) A task allows learners to select the linguistic resources they intend to use. 
(4) A task has a clear outcome other than the display of language. 


A rationale for the use of tasks can be drawn particularly from the 
extensive research on focus-on-form, which is characterized as the 
approaches that comprise “an occasional shift of attention to linguistic code 
features — by the teacher and/or one or more students — triggered by 
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perceived problems with comprehension or production” (Long and Robinson 
1998: 23). It is assumed that tasks can be a useful means to practice the 
focus-on-form approach, i.e., to draw a learner’s attention to the linguistic 
features while she/he is primarily engaging in meaningful communication. 
In this approach, as Larsen-Freeman (2003) discusses, the linguistic features 
to be focused on are form, meaning, and use, and the degree of effectiveness 
of the focus-on-form approach depends on the level of integration of the 
learner’s attention to all the three aspects of linguistic features (Doughty and 
Williams 1998: 245). As discussed in Section 2, grammaring, which we 
would like to foster in the learner, is a skill that should embrace all the 
aspects of form (correctness), meaning (meaningfulness), and use 
(appropriateness) in a particular context. Thereafter, tasks can prove to be a 
useful means to improve the grammaring skill. 

Most importantly, tasks can simulate the real world in a classroom. 
However, in terms of providing opportunities for the interaction and 
negotiation of meaning in a particular context, some warnings have been 
served against the employment of tasks in the ESL context. Seedhouse 
(1999) claims that task-based interaction tends to minimize language forms 
in order to accomplish the completion of a task, while Skehan (1998) points 
out that the learner may rely too heavily on communication strategies for the 
sake of communicative effectiveness. Furthermore, tasks designed for use 
in ESL contexts are often too demanding for EFL learners (examples of such 
tasks are provided in Foster and Skehan 1996: for instance, a speaker is 
requested to ask his/her partner to check the oven in the speaker’s room). 
Hence, directly introducing such tasks in Japanese public schools will prove 
unsatisfactory not only with regard to the language level of the learners but 
also from a practical viewpoint. Most learners have not even reached a 
point where they can rely on communication strategies. 

Given the drawbacks of employing tasks in the EFL context, in order to 
foster grammaring, Takashima (2000, 2005) has proposed employing TAs, 
which are somewhere between tasks and “task-oriented activities” (TOAs 
hereafter), with the latter being commonly used in Japan. TAs are designed 
to induce learners at some point to use one particular structure selected from 
among other similar structures to best suit the given context, while 
maintaining their primary focus on the message and completion of the 
activity.” In contrast, TOAs are designed to allow learners to practice and 


> According to an existing argument, creating a production task that makes specific 
structures essential for its successful completion is difficult (Loschky and Bley-Vroman 
1993). However, we have found that native speakers frequently use specific structures 
when engaging in task activities. Thus, we are convinced that although it is difficult, it 
is possible to create task activities that induce the use of targeted structures. 
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learn a particular structure, which is usually taught immediately before the 
activity itself. These are usually accompanied by a model dialogue and/or 
the vocabulary and structure(s) to be practiced and learned in a meaningful 
context. In the following section, the TA is contrasted with the TOA, and 
both are discussed along with concrete examples. We conclude that in 
order to enhance grammaring, TAs need to be practiced together with TOAs 
in a congruent manner. 


5. TAs and TOAs in the classroom 
According to Takashima (2000: 36), the following are the necessary 
(although not exhaustive) conditions that TAs must meet. They should 


(1) require completion, 

(2) be message-focused, 

(3) involve a negotiation of meaning, 
(4) entail a comparison of structures, 
(5) contain an information gap, and 
(6) be of interest to the learners. 


While TAs play a crucial role in fostering grammaring, other language 
activities that provide the necessary basis on which TAs can be practiced are 
also essential for the learners. In addition to TAs, four other major 
classroom activities can be identified: Drills, Exercises, TOAs, and Tasks. 
TOAs and tasks have already been referred to in the previous sections. 
Drills are basically the pattern practices of a word or a sentence, while 
exercises are activities that are designed to practice the manipulation of 
grammatical rules in sentences. These five activities are represented on the 
continuum shown in Figure 2. They cannot be strictly categorized since 
they vary in their orientation and degree of emphasis toward code (or form) 
or message (or meaning). 

At this point, it might be countered that the introduction of the TA has 
simply made activities more complex; it is believed that the mere provision 
of tasks to students should be sufficient for them to use English in the 
classroom. However, as previously mentioned, most Japanese learners 
tend to be categorized as beginners, or false beginners, particularly in oral 
speech, and therefore, their training requires a stepwise approach to deal 
with the situations where they must use particular structures comparatively 
in the context given in the TAs. This is a modified pedagogical task in a 
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task-supported in contrast with task-based language teaching situation.° 
This idea fits perfectly in a language learning context that presents few 
opportunities for more authentic communicative experiences, as is the case 
in Japan. 

Below the continuum, the three stages of the presentation, practice, and 
production (PPP) approach have been depicted. A typical procedure of this 
approach generally begins with an explicit or implicit presentation of a 
focused grammatical item, which aims at the development of grammatical 
knowledge. This is followed by the practice stage, which provides 
opportunities for the learners to use the knowledge that they have acquired at 
the presentation stage. Finally, at the production stage, the learners are 
required to produce utterances more freely and spontaneously than at the 
practice stage. Despite its popularity among teachers and the fact that it is 
still being practiced all over the world, this approach has been criticized as 
being incompatible with the findings of second language acquisition research, 
which state that the learner does not simply learn one grammatical item after 
another (e.g., Skehan 1998; Scrivener 1996). 

However, the traditional PPP approach is appropriate when the 
production stage is interpreted as the final stage that roughly corresponds to 
the continuum where the TAs and tasks are placed. It is a misinterpretation 
to state that the final production stage is equal to the TOA level. The 
production stage does not require students to speak out loud what they have 
learned, but rather, it requires them to display their creativity. 


® Some terminological definitions are required at this point. A pedagogical task, 
according to Nunan (2004: 4), is “a piece of classroom work that requires learners to 
comprehend, manipulate, and produce or interact in the target language while their 
attention is focused on mobilizing their grammatical knowledge in order to express 
meaning, and in which the intention is to convey meaning rather than to manipulate 
form.” In task-based language teaching (TBLT), tasks “function as a useful device for 
planning a communicative curriculum,” while in task-supported language teaching 
(TSLT), they “are seen not as a means by which learners acquire new knowledge or 
restructure their interlanguages but simply as a means by which learners can activate their 
existing knowledge of the L2 by developing fluency. This is clearly a lesser goal for 
tasks, as they do not replace exercises.” (Ellis 2003: 30) 
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Communication Activities 


More Form-Focused More Message-Focused 


_ 1] P(Presentation) 7 = P(Practice) [| P(Production) 


(Takashima 2005: 8) 
Figure 2. Five Major Language Activities in the Classroom 


In many language classes, TOAs appear to be predominantly employed 
under the guise of communicative teaching. Although TOAs should be the 
activities at the practice stage of the continuum, they sometimes appear to be 
mistakenly considered as the activities at the production stage. While they 
may be useful for familiarizing learners with the target language, they cannot 
be expected to assist grammaring. This is shown in the following TOA 
example that presents a conversation between two Japanese students who are 
practicing the target present perfect form: 


Hiro: Have you watched Star Wars Episode II? 
Makoto: Episode II? Yes, I have. 

Hiro: How was it? 

Makoto: It was thrilling. 

Hiro: Oh, thrilling. 

Makoto: Have you watched Rasen? 

Hiro: Yes, I have. 

Makoto: How was it? 

Hiro: It was scary. Very scary. 


This TOA was created for learners to use the present perfect form 
following a model dialogue.’ The utterances employed show that the 
learners certainly exchanged meaning and practiced the present perfect form, 
maintaining a smooth flow throughout the model dialogue. However, 
TOAs such as this one do not facilitate grammaring, since they do not 


” The following model dialogue was presented to the learners in the TOA. 


A: Have you watched “Harry Potter”? 
B: Yes, I have. 

A: How was it? 

B: It was interesting. 
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require learners to choose a particular structure from among other similar 
structures (in this case, present or past verb forms). A TOA is created with 
the aim of enabling the learners to use a particular structure in a particular 
context. Therefore, the learners are not required to engage in grammaring, 
but are simply required to apply grammatical knowledge. Thus, they do not 
use their knowledge on-line in a grammaring process. This point can be 
further clarified by analyzing another conversation between the same two 
Japanese students engaging in a TA after performing the TOA presented 
above: 


Example conversation in a TA 
Hiro: Uhn, Makoto, where do you want to go, go? Ah, I have three idea. 
One I want to go United of nani Kingdom, England. (= Makoto, where 
do you want to go? I have three ideas. One is to go to the United 
Kingdom, i.e., England.) 
Makoto: It’s... 
Hiro: I watch, watched Stonehenge on TV yesterday. And I, I have studied 


English for nine years, I want to talk English. (=I saw Stonehenge on 


TV yesterday. As Ihave studied English for nine years, I’d like to use 


English in England.) 

Makoto: Oh. 

Hiro: Ah, second is Egypt. 

Makoto: Egypt? 

Hiro: Ihave, I have read a book about Pyramid. (= I have read a book about 
the Pyramids.) 


In the above conversation, the students were trying to choose the most 
appropriate linguistic forms to express their intended meanings by drawing 
on the knowledge practiced in the TOA. For instance, Hiro appropriately 
used the past, present perfect, and present tenses in her second and fourth 
turns. When she rephrased her sentences such as “I watch, watched...” and 
“And I, I have studied...,” she was selecting the most appropriate tense for 
the meaning that she wanted to convey. Although the grammaring process 
cannot be directly observed, we can observe it indirectly through her 
utterances that constitute grammarization (i.e., her production). While 
engaging in the TA, Hiro was struggling to convey the intended meaning in 
English, seeking the appropriate forms and choosing one form over another. 
In other words, she was engaging in grammaring. 

The skill-acquisition theory suggests that a skill can be acquired by 
engaging in the target behavior of using language, while maintaining the 
relevant knowledge in the working memory (DeKeyser 1998). Thus, in the 
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TA, Hiro was engaging in the target behavior by temporarily drawing on the 
grammatical knowledge that she had learned and practiced in the TOA, 
which is what the theory claims. 

With regard to the advantages of engaging learners in TAs, Sugiura 
(2002) reviewed the empirical research investigating the effects of TAs as 
compared to those of TOAs. All of the reviewed research compared the 
improvements in the scores obtained on written tests in terms of the targeted 
structures, while some also considered the scores of the oral tests. The 
results convincingly indicated that, overall, the groups that used TAs were 
more accurate in identifying the targeted forms in the written tests as 
compared with the groups that used only TOAs. In addition, it was reported 
that after engaging in the TAs, the learners had developed a more favorable 
and positive attitude toward using the target language. With regard to the 
oral tests, the improvements in accuracy as well as the appropriateness in the 
use of the target forms were observed qualitatively, although statistically 
significant differences were not found. 

Based on the research findings, employing TAs with TOAs compared 
with using TOAs alone produced three positive effects (Sugiura 2002): 


(1) _ increased understanding of the targeted forms; 
(2) more accurate and appropriate use of the targeted forms; and 
(3) more positive attitude toward the use of the target language. 


The lack of statistically significant differences in the oral tests is 
probably due to the difficulties of assessing the effect of a treatment after 
only a few classes and designing appropriate pre- and post-tests to measure 
the effect. Language learning is not a simple cause and effect process, and 
measuring oral proficiency is more complex than measuring understanding. 
Adapting ideas from the science of chaos/complexity, Larsen-Freeman 
(1997) rightly questions, “How will we know if our treatment is an ordinary 
pebble, or an avalanche trigger?” Thus, it may be premature to conclude 
the further effects, if any, that TAs have on language learning without 
conducting a more longitudinal and qualitative research. However, having 
determined that TAs do contribute to grammaring, we should attempt to 
combine the three major elements discussed above — grammaring, 
grammarization and classroom activities. 


6. Grammaring, grammarization and classroom activities 

We have noted that grammar has two facets — grammar as knowledge 
and grammar as a skill (i.e., grammaring). In addition, we have proposed 
that the concept of grammarization as a product of grammaring is necessary 
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to capture the learner’s grammaring progression. In Table 1, we have 
illustrated the relationships of grammar as knowledge with grammaring and 
grammarization in terms of classroom activities (Takashima and Sugiura 
2002). The process view of grammar (grammaring) is contrasted with the 
product view of grammar that considers grammar as a set of rules and 
knowledge. From these two perspectives, the table presents the object of 
teaching, the roles of the teacher, the object of assessment, and the best 
means of assessment. 

The product view of grammar aims to teach knowledge that conforms to 
the target norms, whereas the process view of grammar aims to facilitate a 
skill with which the learner can use language appropriately, accurately, and 
meaningfully depending on the context. Although the two views are 
different, both are equally essential in language learning/acquisition. The 
teacher should construct a syllabus or curriculum taking both these views 


into consideration. 


Table 1. 


Grammarization and Classroom Activities 


In this regard, TAs and TOAs are useful vehicles. 


A Comparison of the Different Views of Grammar in Relation to Grammaring, 


Views of grammar 


Features of the views 


“Product” view of grammar 


“Process” view of grammar 


Object of teaching 


Se2hteiieS Grammars. 
product 
knowledge 


a set of rules 


Grammaring 


process 
skill 


principles of language use 


Teacher’s role 


transmitting grammatical rules 
and training the learner for 
target-like language use 


providing appropriate conditions 
to facilitate the learner’s language 
development 


Object of assessment 


soe seseeees Grammar 
product 

accuracy and meaningfulness 
measured for conformity against 


the target norms 


ere Sra OT as 
product of grammaring 

accuracy, meaningfulness, and 
appropriateness in individual 


learner’s language use 


Best means of assessment 


provide opportunities to apply 
the learned knowledge 


create social and _ linguistic 
contexts to use the skill of 


grammaring 
e.g., TOAs e.g., TAs 
(Takashima and Sugiura 2002) 
7. Conclusions 
After emphasizing that grammar has two facets — grammar as 


knowledge and grammar as a skill — the latter being termed as grammaring, 
this paper has discussed the concept of grammarization as a product of 
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grammaring to capture the grammaring progression of learners. As 
Thornbury (2001) points out, the indispensable role of the teacher is to 
provide the optimal conditions for the growth of the learners’ grammaring. 
These optimal conditions imply a context in which these learners can use 
language creatively and an atmosphere in which they feel the need to use the 
target language. In addition, appropriate feedback should be provided at 
opportune times. 

The conditions that facilitate grammaring will be different for different 
learners. Only through careful observation of their students’ grammarization 
can the teachers know the feedback and feed-forward to be provided to them 
in order to create the most desirable conditions for grammaring in the future. 
On the basis of both the theory and empirical research, we propose that it is 
possible for TAs to facilitate this grammaring, which will eventually lead to 
target-like grammarization. We believe that, in EFL conditions, by 
employing TAs and not tasks, teachers can capture their learners’ 
grammarization and provide them with accurate feedback. 

In this paper, we have been more concermed with the processes involved 
in the grammatical accuracy of language acquisition. Further studies 
investigating these grammaring processes on-line, that is, investigating the 
performance analysis of TAs, are necessary to attain a better understanding 
of the mechanisms involved in acquiring a second language. For instance, 
the extent to which the task difficulty affects the amount of negotiation of 
meaning on-line needs to be examined. Further related studies are 
definitely necessary in this EFL context. 
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A Computerized Test of Oral Proficiency 
— Development of an Automated 
Instrument — 


Lynne HANSEN and Joshua ROWE 


The work reported in this chapter is motivated by the practical need to 
efficiently and objectively assess the second language speaking ability of 
large numbers of people. The assessment method described to fill this need 
grew out of two lines of previous research 1) studies of temporal variables in 
speech and 2) recent advances in automatic speech recognition (ASR), which 
together suggest means for the development of efficient and objective speech 
evaluation. 

Numerous studies of temporality in speech have been published over a 
span of four decades (for reviews, see Zellner, 1994; and Kenny, 1996). 
Beginning with Goldman-Eisler’s (1968) pioneering research on pause 
phenomena in L1 English, the early work helped to establish the association 
of pauses in speech with the planning of the succeeding utterance. In 1978 
the field of “pausology” became established with an interdisciplinary 
conference held in Kassel, Germany (Dechert & Raupach, 1980). Subsequently 
studies of hesitation phenomena in the speech of L2 learners have also 
appeared, attempting to relate the temporal variables to the development of 
proficiency as a second language is learned (Lennon, 1990; Riggenbach, 
1991; Riazantseva, 2001). Since the mid 90’s, reports have appeared of 
attempts to relate temporality in L2 speech to changes in proficiency as a 
second language is lost (Nakuma, 1997; Russell, 1996, 2002; Tomiyama, 
1999; Yoshitomi, 1999. For a review, see Hansen, 2001). 

Both second language acquisition and attrition are examined in our 
studies of temporality in the L2 speech of L1 English speakers who had 
learned Japanese as young adults in Japan (Hansen et al., 1997, 2000, 2002), 
We measured hesitation phenomena in three sets of Japanese storytelling 
data: acquisition data from learners in Japan, attrition data from returnees 
back in the US from the same population, and control group data from native 
speakers. As in all of the studies cited above, the temporal variables (e.g., 
silent pause, L1 filler, L2 filler, universal filler, total talk time, speech run, 
laughter) were measured and recorded manually, a laborious and 
time-consuming process. Although the results of the work on temporality, in 
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our study and others, reveal significant relationships between the timed 
variables and scores from oral tests, the resources required for such manual 
measurement seemed initially to preclude any immediate practical 
applications to language testing. 

Recent advances in speech recognition technology (ASR), however, 
have made possible such applications to the assessment of speaking. Several 
researchers have attempted to develop methods for rating pronunciation 
using ASR (Bernstein et al., 1990; Cucchiarina, et al., 1997, 2000; Franco, et 
al., 1997; Neumeyer, et al. 1996; Suzuki et al., 2005). Analyses first of read 
speech and then of spontaneous speech have suggested that automatically 
obtained measures of temporality are significantly related to scores of 
pronunciation quality assigned by trained human raters 

In one application of speech recognition technology, a private testing 
company, Ordinate (1998), developed an automated oral English test, 
PhonePass’™, which is administered over the telephone with a computer. 
Speech rate is a major assessment component of this test which is said to 
measure facility in spoken English. Diagnostic sub-scores are given for 
reading fluency, listening vocabulary,reciting/pronunciation, repeat accuracy, 
and repeat fluency. 

Encouraged by these developments in the automated testing of oral 
proficiency, we worked to develop a computerized instrument to measure 
temporality in speech that could be economically administered in educational 
settings (see methods section for details). 

In addition to having the capacity to measure timed elements in speech 
samples that are elicited by the instrument itself, the program also allows for 
the analysis of independently entered speech samples. Thus, we were able to 
enter our L2 Japanese data in order to compare the manual measurement of 
temporality using SoundEdit, reported in our previous studies cited above, 
with the automated measurement of the same data. The two analyses yielded 
similar results. Slight differences between them in some instances were 
found to be due to human error in the manual measurement. The program did 
better in a few seconds what it had taken hundreds of hours for student 
researchers to do by hand.(Hansen, et al., 2002). 

With the prototype of the new testing instrument completed, we set out 
to answer the following questions: (1) Can the automated speaking test be 
efficiently administered in a university ESL program? (2) Are scores from 
the test predictive of the students’ placement in their ESL speaking classes in 
the program? (3) How does assessment of SPEAK tests by the automated 
test compare with the ratings of expert human raters? 
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Methods 
Subjects 

The 210 subjects who took the automated speaking test were enrolled in 
English speaking classes in a university ESL program in Hawaii. These 
college students had each been placed in one of three English proficiency 
levels for their speaking class: Level 1 (Foundations), n = 38; Level 2 
(Intermediate), n = 59; or Level 3 (Advanced), n= 113. The subjects came 
from diverse cultural and linguistic backgrounds, the majority of them from 
the Asian rim and the South Pacific: Japanese, n = 47; Chinese, n = 42; 
Korean, n = 39; South Pacific, n = 32; and Other,n = 45. The control group 
consisted of 16 native speakers of English, students at the same university. 


Instrument 

The computerized speaking test was designed to collect demographic 
information, elicit and record speech samples, and compute fluency scores 
based on analyses of those samples. After a simple welcome screen, the 
test taker is prompted to fill in some personal information. Then he/she 
watches a series of silent video clips containing scenes of student life, 
embellished with slapstick humor. Each clip is three and a half minutes in 
duration and presents more than enough information for the students to speak 
about. After each clip, still frames of the action are presented to remove the 
challenge of having to remember the events, together with prompts to elicit 
the speech. In addition to the English speech sample, an L1 sample was 
collected and scored separately. For the analysis, the program breaks down 
the wave files of the speech samples into key temporal variables. The total 
score is calculated using an algorithm based on the strength of relationships 
of the respective variables that have been found in correlations with 
measures of speaking ability. 

As pointed out above, in addition to being able to analyze the speaking 
samples elicited by the test, the program can analyze recorded speech 
samples which are independently entered. This was done with SPEAK test 
data which had been assigned ratings by official raters for the purpose of 
comparing the official scores with our automated test scores. For this 
SPEAK test comparison, the formula used to calculate the machine scoring 
was calibrated to the strengths of the relationships between the temporal 
variables and the rater assigned scores. 


Data Collection and Analyses 

The test was installed on computers in a computer lab on the university 
campus, and each subject was provided with a headset with a voice 
recognition microphone which utilized background noise cancellation 
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technology. The particular computer lab was chosen because it was one 
which the students are not ordinarily given access to, in order to help 
facilitate quiet in the testing environment. The subjects came to the room 
in groups of 10 to 12 and each was assigned to a computer that was spaced 
apart from others to the extent possible. A brief orientation was given 
concerning the nature of the test and the use of the microphones for 
recording the spoken responses. Then each of the students selected and 
opened the testing program using the mouse, and independently completed 
the test. The recorded_wavefiles of the student responses and their scores 
were sent to a database that had been set up for the project, and the scores 
were automatically e-mailed to the students’ teachers. A one-way ANOVA 
analysis by proficiency level was used to relate performance on the test to 
student placement. 

The SPEAK test is a widely used test of English speaking ability that is 
put out by the Educational Testing Service (ETS, 2001). For the comparison 
of the computerized instrument with this test, we used the speech samples 
from 14 SPEAK tests that had been rated at ETS by official raters. Having 
determined that only responses of at least 60 seconds yielded reliable results, 
we eliminated the 5 samples from among the 12 samples on the test that 
were less than a minute in duration. This left us with 7 speech samples for 
each of the 14 tests. These samples were entered into our program for the 
automated analysis. Finally, the scores from this analysis were correlated 
with the scores that had previously been assigned by the human raters. 


Results 

Addressing the first research question, on whether or not the test could 
be efficiently administered in an ESL program, our answer is in the 
affirmative. The answer for any particular program, of course, would 
depend upon having the technical capability, and the willingness of faculty 
and students. In general, however, we conclude that the test is relatively 
easy to administer to large numbers of test takers. The testing time for 
students varies according to how long it takes to enter in the demographic 
information and collect their thoughts after each video clip is viewed, but 
most in our pilot administration took between 15 and 20 minutes. The 
process of test administration is simple and requires little monitoring of the 
students. The automated speaking test appears to have the potential to save 
faculty time that is currently used for placement interviews or other speaking 
evaluations. 

In order to address the second research question concerning the 
correspondence of speaking class placement to temporality in the students’ 
speech, we ran one-way ANOVAs of the temporal variables by proficiency 
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level. In addition to the English data which we expected to relate to 
placement, as a comparison, we decided to run ANOVAs on the mother 
tongue data, which we expected would not relate to the students’ placement 
in an English class. As shown in Table 1, below, the expected results were 
obtained. 


Table 1. Relationships of ESL Speaking Class Level to Temporal Variables in English and 


L1 Speech 
English Mother Tongue 
Temporal Variable F sig. F sig 
Silent Pause Time 6.91*** = 001 .082 921 
Silent Pause Length 4.84** .009 .157 855 
Total Run Time 6.72** .002 .107 .896 
Length of Run 3.60* .030 .168 843 


*p<.05~ **p<0l~ ***p<.001 


We see here that temporality in the students’ L1 has no relationship 
whatever to the placement they had been assigned in their English speaking 
class. On the other hand, the hesitation phenomena in their English speech 
do relate significantly to their ESL speaking placement. Notice also on the 
table that the temporal variable with the strongest relationship to the 
placement level is Silent Pause Time (F (2, 210) = 6.91. p < .001). 

We turn now to our final question concerning the comparison of the 
assessment of speech samples by the automated test with the ratings assigned 
to them by the expert human raters. The Pearson product moment 
correlations showed that the r values for the following variables were 
statistically significant: Silent Pause Number, r = -.47, p < .05; Total Time 
of Silent Pause, r = -.80, p< .05; Average Length of Silent Pause, r = -.58, 
p < .05; Number of Runs, r = -.47, p < .05; Total Run Time, r = .70, p < .05; 
and Average Run Time r = .67, p < .05. The results of the correlations 
between these temporal variables and the SPEAK test score were used in the 
calibration of the equation for the automated measurement of the speech 
samples. 

Using the resulting equation for the automated measurement of the 
SPEAK test data, the correlation between the machine and the human ratings 
was .988, p = < .01. The scores, in other words, were nearly the same. 
The comparison between the fluency scores determined by the human raters 
and the scores from the machine measurement was used in the fine-tuning of 
the equation for the automated program. That new equation is currently 
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being used in the second generation of the instrument which is now being 
tested. 


Conclusions 

In the ongoing piloting of the test, the optimum equipment for the 
recording of speech samples is being sought. While the majority of students 
appear to be familiar with the microphone headsets, there are still a good 
number who have to be individually instructed on their use, and the 
recording quality could be better. In addition to quality, the file size of 
recordings is being examined and adjusted in an attempt to determine the 
most efficient means for producing a viable sample. The size may be 
important when testing large numbers of students because of the potential 
limitations of hard drive space for saving the samples before they can be 
written to CD or DVD. 

The results of the data collection so far suggest that, at least in the short 
term, this automated oral assessment could offer an economical test that is 
predictive of the speaking ability of language learners, even though its 
construct validity is not yet firmly established. For the longer term, the 
developing instrument promises to contribute to continued improvement in 
the speech recognition technology used in language testing, and ultimately to 
a new generation of automated speaking tests that will be able to support 
stronger construct claims. 

In addition to educational contexts, several research applications of the 
technology are apparent as well. Researchers interested in oral language 
development can efficiently obtain measurements of temporal variables in 
subjects’ speech samples, as well as a speaking score from the algorithm 
derived from the SPEAK score correlations. Analyses can be done of 
speech samples elicited by the test, or speech samples independently 
collected by the researcher. Similarly, the technology could be used by 
language attrition researchers to track the loss of fluency as languages are 
forgotten. 

A further research application is suggested by Riazantseva”s (2001) 
study of pause phenomena in the speech of Russian learners of English. 
Her comparison of temporality in the speech of native speakers of these two 
languages shows significant differences between them in pausing 
conventions, and the differences come into play in the process of L2 
acquisition. The lower proficiency learners in the study maintained L1 
pause behavior in their L2 English while the more proficient learners had 
acquired the ability to accommodate to L2 conventions of temporality. 
Clearly, comparisons of temporality in world languages can potentially 
contribute to our understanding of the processes involved in the L2 
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acquisition of paralinguistic features. The ASR technology described here 
could facilitate comparative studies of temporality between first and second 
languages. 
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1. Background 

The last few decades have witnessed the mushrooming of proficiency 
scales to measure foreign language ability. Some of these scales include the 
ACTFL (American Council on the Teaching of Foreign Languages) 
Proficiency Guidelines (ACTFL PG), the Canadian Language Benchmarks 
(CLB), and currently, the most widely accredited, the Common European 
Framework of Reference for Languages (CEFR) (Council of Europe, 2001). 
Hudson (2005: 205) states that “these developments have been motivated by 
a perceived need to produce test results that are more transparent than has 
traditionally been the case.” Language test users around the world have 
begun to express their dissatisfaction with a simple, numerical type of 
feedback of their test results. Instead, they demand more qualitative 
descriptions of their language proficiency or ability. Such descriptions, 
which are behaviorally based and attempt to describe proficiency according 
to “real-world” performance in specific contexts, are “felt to be much more 
informative than the mere numbers that many learners are used to getting 
from language tests” (Huhta and Figueras, 2004: 74). 

With regard to the CEFR, Heyworth (2004: 17) states that “the words 
‘intermediate’ and ‘advanced’ are vague terms which tend to be interpreted 
very differently in different contexts” and that “the scale from Al to C2 
makes level statements more transparent, and the scale is increasingly being 
used to describe and compare levels both within and across languages.” It 
should be noted, however, that although these scales are designed to provide 
learners with a “common” framework of reference in its intended region, 
they are, in actuality, “local” from the global perspective. Hence, at present, 
it may not be easy to apply one framework to different contexts. The 
purpose of the CEFR, which is designed for use across the different 
languages of Europe, is threefold: (1) to promote and facilitate cooperation 
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among educational institutions in different countries; (2) to provide a basis 
for the mutual recognition of language specifications; and (3) to assist 
learners, teachers, course designers, examining bodies, and educational 
administrators to situate and coordinate their efforts. The above functions 
have arisen from the specific needs of EU countries and may not be 
applicable per se in other parts of the world. The rapid globalization that is 
currently in progress necessitates a “universal” framework that can be 
applied to virtually any context. This may not be an easy task since what 
we are interested in is extremely complex in nature. 

The purpose of this paper is to report the process that we adopted to 
develop a proficiency scale for Japanese high school students based on a 
well-established English proficiency test—Global Test of English 
Communication for Students (hereafter, GTEC for STUDENTS)—and to 
then relate this scale to the CEFR in order to determine its universality and 
particularity as a language proficiency framework. These scales were 
developed independently, using different procedures. In the development 
of the CEFR, the scale level descriptors were created by using the 
multifaceted Rasch model analysis, which was used to calibrate the 
descriptors. This is the primary difference between the CLB and the 
ACTFL PG The GTEC scales are also empirically based; however, they 
are created on the basis of scores from a specific test and its accompanying 
questionnaires. 

For our research, we used the GTEC for STUDENTS, Operations (or 
OP) 1 to 6, from the Benesse Corporation. The GTEC for STUDENTS was 
developed to assess the English proficiency of high school students. It 
consists of three sections: Reading, Listening, and Writing. The Speaking 
Section of the test was not available during the research stage of this project 
and is therefore not included in this paper. The GTEC for STUDENTS 
uses linguistic materials drawn from everyday situations, and it measures 
reading (total points: 320), listening (total points: 320), and writing (total 
points: 160) abilities on an 800-point absolute scale that is based on the item 
response theory (IRT). On the basis of their GTEC for STUDENTS scores 
(using a scale of 1 to 6), test-takers are divided into six proficiency grades. 

The overall test specifications for the Reading and Listening Sections of 
the GTEC for STUDENTS are as follows: 


Table 1. The Overall Test Specifications of the GTEC for STUDENTS 
Reading 


1. Reading ability is measured from various perspectives. 
Parts A through C measure the candidate’s reading ability through multilateral 
questions aimed at assessing vocabulary and grammar, speed reading ability, and close 
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reading skills. 
2. The speed reading section lays special emphasis on scanning and skimming abilities. 
The speed reading section is designed to test scanning ability (the ability to retrieve 
information in a short period of time) and skimming ability (the ability to grasp the 
outline). 
3. The close reading section consists of three parts that differ in topic and text type. 
The close reading section consists of three English passages. One is fictional and 
two are from the humanities and natural sciences fields. 
The text types are as follows: one fictional, narrative style passage and two expository 
style passages from texts in the humanities and natural sciences fields. 


Listening 
1. The Listening Section is heavily weighted in this practical language test. 

The Listening Section accounts for 20-30 per cent of the total test time, and 
comprises approximately 50 per cent of the total number of test items. 
2. Listening ability is measured comprehensively. 

The Listening Section is designed to test the candidate’s ability not only to 
comprehend accurately but also to offer an immediate response, identify target 
information, and grasp the outline. 

3. The tests are based on tasks that the candidate may perform in real-life situations. 
This practical application of language in actual situations is aimed at giving students 


a sense of communicative purpose to their test tasks. 


2. The Development of the Proficiency Guidelines 
2.1. The Approach Taken to the Development of the Proficiency Guidelines 
When attempting to develop an English proficiency scale based on an 
English proficiency test, the first task that comes to mind is trying to 
describe what language test-takers successfully accomplish in the test. 
However, this is more difficult than it sounds. If the tests are completely 
criterion-referenced or performance-based, it is usually not very difficult to 
describe the test-takers’ performance. However, most proficiency tests are 
neither criterion-referenced nor entirely performance-based. Instead, in 
actuality, they are norm-referenced. Furthermore, the test items generally 
include a somewhat narrow range of topics, situations, and tasks. Due to 
these restrictions, it is difficult to infer how well test-takers will actually 
perform in real-life situations. Given these difficulties, there seem to be 
two fundamental approaches to deduce test-takers’ English language abilities 
based on their test scores: (1) It is necessary to scrutinize the characteristics 
of the test items that test-takers’ answer correctly and describe how tasks 
within these items are related to fundamental English language tasks in the 
real world. (2) It is necessary to relate the test-takers’ scores to their actual 
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experiences in performing English language tasks outside the domain of the 
test. The former approach generates what we refer to as the Proficiency 
Guidelines (PG) in this paper, and the latter approach generates the Can-do 
Statements (CD). 


2.2. Descriptions of the GTEC for STUDENTS Proficiency Guidelines for 

Each Grade 

Due to space constraints, we are unable to provide a detailed report of 
the development processes that were used for the descriptions of the listening 
performance ability in the Listening Section. This section consists of four 
subsections, each of which includes a different test type. First, we 
calculated the item difficulties using the IRT and then placed the test items 
along with their grades in each subsection in their order of difficulty. 
Second, we looked at the actual items in each grade, and third, we generated 
the descriptions of listening performance ability based on the characteristics 
of test-takers’ responses to the questions. 

The statements relating to each grade were formulated to convey how 
well the test-takers performed real-life tasks rather than how well they 
performed tasks in the test situation. Let us consider the example of 
subsection B. A description falling into the Grade 6 level of ability of test 
performance would indicate that the test-taker was able to choose an indirect 
response to a spoken prompt. Based on the assumption that test-takers who 
can choose indirect responses to spoken prompts can also utilize a wide 
range of other responses, the description was formulated as “Can respond 
appropriately to spoken prompts with a wide variety of expressions by 
inferring the speakers’ intentions.” It is worth noting that the item difficulty 
is subject to the influence of several extraneous factors; therefore, it is 
difficult to arrive at common descriptions for each level of ability. It is 
necessary to remember that the descriptions for each level of ability are 
created from merely a handful of test items. 

Table 2 shows the entire framework for the descriptions of test-takers’ 
performance in the Listening Section. 
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Table 2. Listening Level and the Definition of the GTEC for STUDENTS Grade 
Score Range GTEC Grade Meaning of the GTEC Grade 


220 and above 6 Can understand a variety of spoken content that 
employs complex structures. / Can understand the 
flow of long stories and conversations and grasp the 
speakers’ intentions throughout the entire story or 
conversation. / Can respond appropriately to spoken 
prompts with a wide variety of expressions by 
inferring the speakers’ intentions. 

200~219 5 Can understand a variety of spoken content that uses 
fairly complex structures. / Can understand the flow 
of long stories and conversations, and grasp the 
speakers’ intentions. / Can respond appropriately to 
spoken prompts by inferring the speakers’ intentions. 
180~199 4 Can make inferences about parts of a story or 
conversation and form judgments by integrating 
information that is spread out at intervals in the story 
or conversation. / Can respond appropriately to 
spoken prompts by inferring the speakers’ intentions. 
160~179 3 Can understand basic vocabulary, but the 
comprehension of common English idioms is 
insufficient. / Can understand the explicit meaning of 
sentences. / Can begin to accurately understand 
stories and conversations of limited scope. / Can 
respond to spoken prompts with simple and 
straightforward expressions. 

140~159 2 Can understand certain high-level words and phrases 
that comprise a few basic words. / Can understand 
parts of stories and conversations sometimes but 
comprehension is unreliable. / Can respond to 
spoken prompts with straightforward and somewhat 
unvaried expressions. 

139 and below 1 Can understand fragments of longer stories or 
conversations that employ basic words. / Can 
understand simple spoken content when basic 
structures are used. / Can respond to spoken prompts 
accurately by using expressions that match the 
structure of the prompts. 


The same development processes were followed for the descriptions of 
the reading performance ability in the Reading Section. Table 3 shows the 
entire framework for the descriptions of test-takers’ performance in the 
Reading Section. 
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Table 3. Reading Level and the Definition of the GTEC for STUDENTS Grade 


Score Range 


GTEC Grade 


Meaning of the GTEC Grade 


230 and above 


6 


Can understand the main idea of a text passage and 
find information that is difficult to locate. / Can 
understand upper-level, senior high school 
vocabulary. / Can read an English text accurately and 
at an appropriate speed. 


190~229 


Can judge which sentences include the main idea of 
a longer text passage and find information that is 
difficult to locate quite accurately. / Can understand 
upper-level, senior high school vocabulary. / Can 
read an English text relatively accurately and at an 
appropriate speed. 


160~189 


Can make simple inferences and find information 
that is relatively difficult to locate. / Can understand 
upper-level, senior high school vocabulary. / Cannot 
read an English text passage quickly but can 
comprehend it accurately to a large extent. 


140~159 


Can make simple inferences and comprehend the 
literal meaning of information that is spread out at 
intervals in a text passage. / Can find information 
that is explicitly stated in a text passage. / Can 
understand intermediate-level, senior high school 
vocabulary. / Cannot read an English text quickly but 
can comprehend it accurately to some extent. 


120~139 


Comprehension of a text’s overall meaning and 
ability to locate information is inadequate. / Can 
understand lower-level, senior high school 
vocabulary. / Reading speed is slow, and there are 
frequent misinterpretations of the meaning of text 
passages. 


119 and below 


Can understand junior high school vocabulary. / Can 
comprehend sentences with basic structures and 
meanings. 


3. Development of the Can-do Statements 
The information that can be inferred solely from the test-takers’ 


performance is limited to what the test actually measures. 


Therefore, in 


order to obtain information about test-takers’ abilities to perform in real-life 
situations, it is necessary to find some way of relating the test scores to the 


information about language activities outside the test situation. 


One 


method we used to establish this link was developing score-related Can-do 


Statements. 


Using these statements, test-takers were asked to fill out 


self-assessment questionnaires about their real-world experiences with 
English language activities. 
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3.1. Development and Administration of the Can-do Questionnaire 

The Can-do Questionnaire was designed to explain what the English 
proficiency test score actually imply outside the test situation. When we 
first began developing the questionnaire, we conducted a brainstorming 
session with undergraduate students at the Tokyo University of Foreign 
Studies, who had studied in English-speaking countries for more than a year 
before they entered the university. Some of the questionnaire items were 
generated as a result of this brainstorming session. In addition, we prepared 
additional questions (particularly about real-life situations outside the 
classroom setting), and Professor Takatsuka of Okayama University created 
questions based on classroom activities. After we created the survey 
questions, we formulated alternative responses to each question. One of the 
alternatives indicated that the students had no experience with the English 
language task mentioned in the question, while the other alternatives were 
graded to exhibit varying degrees of experience with the task. Finally, we 
prepared a total of 53 questionnaire items. Using this questionnaire, we can 
relate test-takers’ scores (the GTEC for STUDENTS scores and grades) to 
their actual performance of English language tasks outside the test situation. 

The Can-do Questionnaire, along with the GTEC for STUDENTS, was 
administered to approximately 900 senior high school and university 
students. The questionnaire asked test-takers about three basic types of 
activities that required the use of English. By inquiring about the nature 
and frequency of the test-takers’ personal experiences with specific English 
language activities, this questionnaire provided highly objective data about 
their use of English even though it adopted a self-assessment format. The 
questionnaire items were divided into the following three categories: 


Category A: Items about situations and activities inside the school or classroom 

22 Items 
Category B: Items about situations and activities in daily life outside school 

19 Items 
Category C: Items about situations and activities in English-speaking countries 

12 Items 


Table 4. An Example of a Category of Questionnaire Items 


Category B: (Question 37) Writing postcards and greeting cards in English 
1. Ihave never written cards or postcards in English. 

2. Ican write simple messages in English. 

3. Ican express myself to some degree using written English. 

4 


I can write most of what I want to express using English. 
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3.2. Results and Interpretation of the Can-do Questionnaire 

The students who selected the non-experience alternatives were 
excluded from the analysis of the data for each questionnaire item. After 
this, we calculated the total number of students falling into each GTEC for 
STUDENTS grade who selected either of the remaining alternatives. The 
data for each questionnaire item were displayed in the form of a bar chart 
(see below). If the percentage of students who selected an alternative 
indicating some experience with a specific English language activity 
(including responses indicating lower degrees of experience) exceeded 50%, 
the data was interpreted to mean that those students had attained “mastery” 
over that activity. An example of one such bar chart is shown below. 


No. of Questions = 1-6 


Grade 1 (19/31) 
61.3% 66.7 33.3 


Grade 2 (59/100) 
59% 


53.7 43.9 2.4 
Answer 2 


Grade 3 (43/85) Answer 3 
50.6% a“ 40.5 9.5 | |p Answer4 


Answer 5 


Grade 4 (34/78) El Answer 6 
43.6% pe es 


Grade 5 (15/77) ' 
19.5% 29 66.1 21.0 


Grade 6 (8/74) 
108% 59.1 31.8 94 


0% 20% 40% 60% 80% 100% 


Figure 1. An Example of a Bar Chart for the Can-do Questionnaire about the Ability to 
Understand News in English on the TV and Radio 


Based on the results displayed in the bar charts, we created a list of CD 
(see below), which—along with the test-takers’ test results and the PG that 
was developed for the test—can be used to provide the test-takers with 
detailed feedback. While constructing the Can-do list, we adhered to the 
following principles: 


1. Develop CD that will most effectively capture the profiles of 
test-takers from each GTEC for STUDENTS grade. 

2. Provide approximately the same number of descriptions for each 
ability level, and the same amount of information for both the 
test-takers with lower grades and those with higher grades. 

3. Ensure that the statements are as encouraging as possible for 
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test-takers by using positive rather than negative wording in the 


descriptions. 


This is the same approach by which the CEFR was 


developed in that “all the statements in the (CEFR) scales are 
positive, and are intended to demonstrate that even ‘low’ levels of 
language learning have value and worth” (Heyworth, 2004: 16). 
As an example of the third point, we can consider the last statement 
of the GTEC CD L Grade 2. This statement could have been 
expressed as “I can hardly understand films without subtitles”; 
instead, it has been expressed positively as “I can understand some 
of the lines in films with subtitles.” 


Table 5. The Complete List of Can-do Statements for the GTEC for STUDENTS 
Grade Listening Reading Writing Total 

I can generally Ican understand {I have no I can answer 
understand the the outlines of difficulty in phone calls from a 
outlines of the textbooks and expressing myself | native speaker 
news on the radio | paperbacks in in letters, e-mails, |appropriately. I 
andTV. Ican English. Ican cards, or diaries can understand 

6 understand the understand most of | without using a most of the classes 
main plots of films | the contents of dictionary. without difficulty 
without subtitles, | homepages on the and can participate 
though there are Internet. in activities in an 
some lines I English-speaking 
cannot understand. country. 
Icanunderstand /|Ican mostly I can usually write |I can usually 
most parts of the | understand articles | my daily interact with shop 
NHK radio of personal interest |schedules. Ican | assistants without 
English programs | in Japanese take down notes difficulty. Ican 
for the newspapers in onthe important | understand most of 

5 intermediate level | English, such as points during the classes in an 
without textbooks. | Japan Times and __| lessons in an English-speaking 
Icanunderstand | Daily Yomiuri, English-speaking | country and 
the main points of | without using a country. participate in 
the news in dictionary. activities. 
English on 
familiar topics. 
Icanunderstand |Ican understand |I can write most of |I can give a 
some parts of the | English books the things I want | five-minute 
NHK radio English | with notes in to express in self-introductory 
programs forthe |Japanese. Ican_ |letters, e-mails, or | speech using 

4 intermediate level | understand most of | cards without notes. I can 


without textbooks. 
I can understand 
some automated 
telephone response 
messages. 


the contents of 
homepages in 
English on the 
Internet. 


using a dictionary. 


guess the topic 
when I attend a 
class in an 
English-speaking 
country. 
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Icanunderstand |Ican understand |I can write whatI |I can interact with 
most parts of the | articles of personal | want to express in | shop clerks 
NHK radio interest in a diary by without difficulty. 
English programs | Japanese consulting with a |I can offer some 
for beginners newspapers in dictionary. Ican |simple 
3 without textbooks. | English, such as take down notes explanations when 
Icanunderstand |Japan Times and _ |on the main points |I am approached 
some of the lines | Daily Yomiuri, of lessons in an in English by a 
in films without with the aidofa |English-speaking | native speaker. 
reading the dictionary. country. 
subtitles. 
I can recognize the |I can understand —_| I can manage to I can understand 
topics of the news | articles of personal | write what I want | what an English 
on the radio and interest in English | to express in cards, | native speaker is 
TV. Ican magazines or letters, and e-mails | saying on the 
understand some | newspapers—such | by consulting with | phone, though I 
of the lines in as Student a dictionary. cannot interact 
2 films with the aid | Times—with notes appropriately. I 
of subtitles. in Japanese, can manage to get 
without using a my work done at a 
dictionary. bank or a post 
office in an 
English-speaking 
country. 
Icanunderstand /|Icanunderstand |I can express I can give details 
telephone simple English myself insimple | about my family 
numbers, prices at | messages, such as |English in letters background, and I 
shops, or those in birthday |andcards. Ican |can calla friend on 
Upper 1 | directions to local | or Christmas maintain adiary _| the telephone. 
places if spoken to |cards. I canread |using simple 
in simple, slow most of the menus | English. 
English. in fast-food 
restaurants. 
I can usually I can read the I can write simple |I can greet people 
understand the names of friends | messages in at any time of the 
time, day of the in Roman letters | greeting cards and | day, or say good 
week, or month. and understand postcards. Ican |bye. I can givea 
I can also English signs like |sometimes write a |self-introduction 
Lower 1 : . : bigs 
understand most of | “Exit,” few lines in my consisting of my 
the simple “Entrance,” diary by name, age, school, 
questions suchas | “Stop,” and so on. | consulting a class, date of birth, 
“Where do you dictionary. and hometown. 
live?” 
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4. The Applicability of the CEFR to an IRT-based English Proficiency 
Test in Japan 
The Council of Europe has recently produced a preliminary pilot 
Manual (Council of Europe, 2003) that presents a procedure for linking 
examinations to the CEFR (see also Figueras et al., 2005). The procedure 
includes the following four stages: 


Familiarization: This involves a selection of activities designed to ensure that 
participants in the linking process have a detailed knowledge of the CEFR. 


Specification: This is a self-audit of the coverage of the examination (content and task 
types), profiled in relation to the categories presented in the CEFR, Chapter 4 
“Language use and the language learner’ and the CEFR Chapter 5 “The user/learner’s 
competences.” The outcome of the specification is a report of how well the 
examination content reflects the descriptive categories of the CEFR. 


Standardization: These are suggested procedures to facilitate the implementation of a 
common understanding of the “Common Reference Levels” presented in the CEFR, 
Chapter 3. Reading and listening items and speaking and writing benchmarks chosen 
by an international panel will be provided to assist training in the standardization of 
judgments. 


Empirical validation: This involves the collection and analysis of test data and ratings 
from assessments to provide evidence that both the examination itself and the linking 
to the CEFR are sound. 


In this paper, I would like to focus on the empirical validation by comparing 
two seemingly corresponding descriptors regarding listening and reading. 
As a matter of course, since the two descriptors under consideration were 
created independently, they cannot be identical; however, they appear to 
share essentially the same language performance or characteristics of 
linguistic knowledge. (The corresponding descriptors are italicized in the 
following tables.) 


Table 6. Correspondence between the GTEC PG L and the CEFR L 
GTEC PG L-6: 
Can understand the flow of long stories and conversations and grasp the speakers’ 


intentions throughout the entire or conversation. 
Corresponding CEFR L-C1 descriptor: 
I can understand extended speech even when it is not clearly structured and when 


relationships are only implied and not signalled explicitly. I can understand television 
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programmes and films without too much effort. 


GTEC PG L-5: 

Can understand the flow of long stories and conversations and grasp the speakers’ 
intentions. 

Corresponding CEFR L-B2 descriptor: 

I can understand extended speech and lectures and follow even complex lines of argument 
provided the topic is reasonably familiar. 


GTEC PG L-4 
No corresponding CEFR L descriptor 


GTEC PG L-3: 

Can understand the explicit meaning of sentences. 

GTEC PG L-2: 

Can understand certain high-level words and phrases that comprise a few basic words. 
Corresponding CEFR L-A2 descriptor: 

I can understand phrases and the highest frequency vocabulary related to areas of most 
immediate personal relevance (e.g., very basic personal, and family information, 
shopping, local area, employment). I can catch the main points in short, clear, simple 
messages and announcements. 


GTEC PG L-1: 

Can understand fragments of longer stories or conversations that employ basic words. 
Corresponding CEFR L-A1 descriptor: 

I can recognise familiar words and very basic phrases conceming myself, my family and 


immediate concrete surroundings when people speak slowly and clearly. 


Table 7. Correspondence between the GTEC PG L and the CEFR OI 


GTEC PG L-2: 

Can understand parts of stories and conversations sometimes, but comprehension is 
unreliable. 

Corresponding CEFR OI (Oral Interaction)-A2 descriptor: 

I can handle very short social exchanges even though I can’t usually understand enough to 
keep the conversation going myself. 


GTEC PG L-2: 

Can respond to spoken prompts with straightforward and somewhat unvaried expressions. 
Corresponding CEFR OI (Oral Interaction)-A2 descriptor: 

I can communicate in simple and routine tasks requiring a simple and direct exchange of 


information on familiar topics and activities. 
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Table 8. Correspondence between the GTEC PG R and the CEFR R 
GTEC PG R-6 

GTEC PG R-5 

GTEC PG R-4 

No corresponding CEFR R descriptors 


GTEC PG R-3: 

Can find information that is explicitly stated in a text passage. 

Corresponding CEFR R-A2 descriptor: 

I can read very short, simple texts. I can find specific, predictable information in simple 
everyday material such as advertisements, prospectuses, menus, and timetables and I can 
understand short simple personal letters. 


GTEC PG R-2: 

Can understand lower-level, senior high school vocabulary. 

GTEC PG R-1: 

Can understand junior high school vocabulary. 

Corresponding CEFR R-A1 descriptor: 

I can understand familiar names, words, and very simple sentences, for example on notices 


and posters or in catalogues. 


Table 9. Correspondence between the GTEC CD L and the CEFR L 
GTEC CD L-6: 
I can generally understand the outlines of the news on the radio and TV. I can understand 


the main plots of films without subtitles, though there are some lines I cannot understand. 
Corresponding CEFR L-C1 descriptor: 
I can understand television programmes and films without too much effort. 


GTEC CD L-5: 

I can understand the main points of the news in English on familiar topics. 
Corresponding CEFR L-B2 descriptor: 

I can understand most TV news and current affairs programmes. 


GTEC CD L-4: 

I can understand some automated telephone response messages. 

Corresponding CERF L-B1 descriptor: 

I can understand the main points of clear, standard speech on familiar matters regularly 
encountered in work, school, leisure, etc. 


GTEC CD L-2 
No Corresponding CEFR L-A2 descriptor 
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GTEC CD L-Upper 1: 

I can understand telephone numbers, prices at shops, or directions to local places if 
spoken to in simple, slow English. 

GTEC CD L-Lower 1: 

I can usually understand the time, day of the week, or month; I can also understand most of 
the simple questions such as “Where do you live?” 

Corresponding CEFR L-A1 descriptor: 

I can recognise familiar words and very basic phrases concerning myself, my family and 
immediate concrete surroundings when people speak slowly and clearly. 


Table 10. Correspondence between the GTEC CD R and the CEFR R 


GTEC CD R-6: 

I can understand the outlines of textbooks and paperbacks in English. 

Corresponding CEFR R-C1 descriptor: 

I can understand long and complex factual and literary texts, appreciating distinctions of 
style. 


GTEC CD R-5: 

I can mostly understand articles of personal interest in Japanese newspapers in English, 
such as Japan Times and Daily Yomiuri, without using a dictionary. 

Corresponding CEFR B2 descriptor: 

I can read articles and reports concerned with contemporary problems in which the 
writers adopt particular attitudes or viewpoints. 


GTEC CD R-4 
GTEC CD R-3 
GTEC CD R-2 
No Corresponding CEFR R descriptors 


GTEC CD R-Upper 1: 

I can understand simple English messages, such as those in birthday or Christmas cards. 
I can read most of the menus in fast-food restaurants. 

Corresponding CEFR R-A2 descriptor: 

I can read very short, simple texts. I can find specific, predictable information in simple 
everyday material such as advertisements, prospectuses, menus and timetables and I can 
understand short simple personal letters. 


GTEC CD R-Lower 1: 

I can read the names of friends in Roman letters and understand English signs like “Exit,” 
“Entrance,” “Stop,” and so on. 

Corresponding CEFR R-A1 descriptor: 

I can understand familiar names, words, and very simple sentences on notices and posters 
or in catalogues. 
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The GTEC for STUDENTS grades seem to be related to the CEFR 
levels in the following way: 


Table 11. Relationships between the GTEC PG and CD Grades and the CEFR Levels 


CEFR L Al A2 Bl B2 C1 C2 
GTEC PGL 1 2,3 — 5 6 — 
GTEC CD L L1, U1 — 4 5 6 — 
CEFR OI Al A2 Bl B2 C1 C2 
GTEC PGL — 2 — 5 6 — 
CEFR R Al A2 Bl B2 C1 C2 
GTEC PGR 1,2 3 = — — — 
GTEC CDR L1 U1 — 5 6 — 


The orders of the GTEC grades correspond to the CEFR levels, 
respectively, although some CEFR levels lack corresponding GTEC grades. 
For example, there is no corresponding GTEC PG L grade for the CEFR L 
B1. However, the CEFR L B1 is located between the GTEC PG L 2, 3 and 
the GTEC PG L 5; therefore, it appears to correspond to the GTEC PG L 4. 
From the above correspondence, the CEFR listening levels and the CEFR 
reading levels can be linked to the GTEC listening grades and the GTEC 
reading grades, respectively, in the following manner. 


Table 12. Relationships between the GTEC Grades and CEFR Levels 


Listening 
CEFR Al Al A2 Bl Bl B2 C1 
GTEC Lower 1 | Upper 1 2 3 4 5 6 
Reading 
CEFR Al A2 Bl Bl B2 C1 
GTEC Lower 1 | Upper 1 2 3 4 5 6 


The GTEC for STUDENTS seems to measure narrower ranges of abilities 
than the CEFR; the former test also assumes smaller steps toward the lower 
end. 

On linking the CEFR with the GTEC CD/PG, several characteristics of 
each framework emerge. The differences between the CEFR and the GTEC 
CD/PG can be summarized as follows: 
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(1) 


(2) 


(3) 


(4) 


(5) 


The CEFR L makes no reference to sentence structures. In relation to this, 
Keddle (2004: 46) states that “the self-assessment descriptors, quite deliberately, 
do not refer to grammar or structures” and that “they speak about how you 
communicate, and how well you understand text and speech—the grammar used 
is a part of your performance of the task.” This is because the CEFR is not 
designed for a particular language. Thus, not only is it inappropriate but it is 
also impossible to describe language ability in terms of grammar or structure. 
This is also the case with the GTEC CD but not with the GTEC PG since some 
of the descriptors in the latter are language-based. 

It is evident that no reference is made to specific local English radio programs 
and specific local English newspapers in the CEFR CD L and the CEFR CD R, 
respectively. This is because the CEFR is a common framework for people from 
different parts of Europe. On the other hand, the GTEC CD targets Japanese 
senior high school students; therefore, it can only describe students’ 
performance in a very specific context, such as NHK’s radio English programs 
and the Japan Times. 

There is no reference to dialect in the GTEC CD/PG L. Proficiency tests such 
as the GTEC tend to adopt so-called “standard pronunciation,” and therefore, 
speakers with strong accents are likely to be excluded from the recording 
process; the CEFR L descriptors, on the other hand, occasionally refer to 
standard or nonstandard language. Such examples include “I can understand 
announcements and messages on concrete and abstract topics spoken in 
standard language at normal speed (B2),” “I can understand a wide range of 
recorded audio material, including some nonstandard language, and identify 
finer points of detail, including implicit attitudes and relationships between 
speakers (C1),” etc. (Italicizing is mine.) 

There is no reference to the speed and clarity of delivery in the GTEC CD/PG L, 
whereas the CEFR L includes descriptors such as “I can understand everyday 
expressions dealing with simple and concrete everyday needs, in clear, slow, and 
repeated speech (A1),” “I can understand the information content of the 
majority of recorded or broadcast audio material about familiar subjects spoken 
relatively slowly and clearly (B1),” and “I can understand announcements and 
messages on concrete and abstract topics spoken in standard language at normal 
speed (B2).” (Italicizing is mine.) Although the speed and clarity of delivery is 
expected to affect the level of understanding significantly, these factors tend to 
be kept constant throughout the test administration. It seems as though there 
are very few English proficiency tests that include varying degrees of clarity and 
speeds of delivery. 

The CEFR R makes no reference to reading speed. Reading speed is one of the 
important characteristics of L2 reading ability. This seems asymmetrical in 
that the CEFR L mentions speed of delivery. On the other hand, there is a 


Presented by: https://jafrilibrary.com 


Presented by: https://jafrilibrary.com 


How Much Do We Have in Common 99 


consistent reference to reading speed in the GTEC PG R from Grades 2 through 
6. 

(6) There is little reference to scanning in the CEFR R apart from the A2 descriptor 
“T can find specific, predictable information in simple everyday material such as 
advertisements, prospectuses, menus, and timetables.” It should be noted, 
however, that DIALANG, which is a suite of computer-based diagnostic tests 
based on the Common European Framework (see Alderson and Huhta, 2005), 
includes a number of scanning-related descriptors. 

(7) Little reference is made to the quality of reading comprehension in the CEFR R, 
and therefore, expressions such as “literal meaning” and “inference” cannot be 
found in the CEFR R. 

(8) The CEFR R has no instances of reference to a dictionary. The use of a 
dictionary while reading a newspaper may well have an impact on the outcome 
of the reading; therefore, it might be a good idea to include such a situation in 
assessing reading. 

(9) As Keddle (2004: 50) points out, the tenor of competences described in the 
CEFR is quite adult and therefore seems “irrelevant to classroom achievements.” 
On the other hand, the GTEC CD as well as the PG appears to be suited to the 
achievements and interests of a teenage learner in the classroom. 


5. Conclusion 

The results that were arrived at in the development of the GTEC CD 
and PG are summarized as follows. First, it is possible to construct a useful 
framework of performance descriptions based on information pertaining to 
item characteristics in English proficiency tests. Second, the ability to 
perform most real-life tasks is closely related to the test grades; however, it 
must also be stated that the ability to perform some types of tasks has little or 
no relation to test grades. Third, by using the Can-do list, we can provide 
test-takers with predictive statements about their ability to perform English 
language tasks outside the test situation. 

The GTEC CD and PG as well as the CEFR CD were developed 
independently and in completely different contexts. However, it was 
demonstrated that there is a close correspondence between the GTEC CD 
and PG and the CEFR CD. This seems to indicate that these three have a 
common foundation for language learning development and reinforce each 
other’s validity. Finally, it is worth noting that the specific requirements 
and contexts of each framework have resulted in different perspectives and 
emphasis on various aspects of language ability. 
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An Inquiry into the Social Aspects 
of Language Teacher Expertise 


Alison STEWART 


Every profession has its experts, that is, people whose knowledge of a 
subject or whose performance of a skill far exceeds that of other members of 
the profession. Teaching is no exception, and there has been increasing 
interest by researchers over the past two decades in understanding what 
distinguishes expert teachers from non-experts, and the process by which 
they come to acquire and maintain their expertise. However, whereas 
expertise was initially regarded as a purely psychological phenomenon, more 
recent thinking (e.g. Clandinin & Connelly, 1995; Connelly & Clandinin, 
1999; Tsui, 2003) suggests that it needs also to be understood from a social 
perspective. In other words, in order to understand expertise, it is necessary 
to examine both the expert practitioner and the social context in which the 
expert’s skill is acknowledged. 

This paper attempts to show why a socially oriented understanding of 
expertise is of particular relevance to the situation of language teachers. 
Drawing on a narrative study of two native English-speaking teachers 
working in Japanese universities, I will propose that expertise does not solely 
depend on individual knowledge, skill or performance, but, just as 
importantly, on a teacher’s capacity to negotiate with others in their work 
communities and to have their voices heard. 


1. Defining Expertise: moving to a social perspective 

Literature in teacher education and teacher knowledge has traditionally 
tended to focus on expertise from a behavioural or psychological perspective 
(e.g. Atkinson & Claxton, 2000; Bereiter & Scardamalia, 1993; Eraut, 2000; 
Schon, 1983). Such approaches attend to distinctions between experts and 
non-experts (Tsui, 2003; Woods, 1996), analyse what experts do when they 
perform (Claxton, 2000; Eraut, 2000; Schon, 1983; 1991) and determine the 
attitude they have towards the activity in which they excel (Bereiter & 
Scardamalia, 1993). Many of these studies concentrate largely on cognitive 
processes of individual practitioners, giving the impression that their 
thinking and behaviour occur largely independently of social contexts. 

However, an increasing influence on this field has come from 
ethnographic case studies of teachers’ lives and work, which show that the 
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knowledge and skills that teachers develop are closely bound up with the 
specific contexts in which they work and with their personal histories of 
experience (Clandinin & Connelly, 1995). Following this approach in her 
case studies of four language teachers in a Hong Kong secondary school, 
Tsui (2003) elaborates that, in addition to being able to integrate knowledge 
of the curriculum or of wider social matters into their daily teaching practice, 
expert teachers relate to specific contexts and ‘situated possibilities’. 
Moreover, experts maintain a particular attitude or disposition toward critical 
self-reflection, or as Bereiter and Scardamalia (1993) term it, ‘progressive 
problem solving’. Finally, she argues that expertise is ‘multiple’ and 
‘distributed’, that is, it exists within social groups or communities, and 
individuals can draw on and benefit from knowledge and practices of expert 
members of their community of practice. 

Despite Tsui’s assertion that expert knowledge is multiple and 
distributed, this in itself is not a social interpretation of expertise. Rather than 
focusing on what individual practitioners do, a social account of expertise 
needs to look at the context of the practice: How do others respond to what 
they do? How do they relate to or fit in with their community of practice? In 
other words, these questions are related to social identity or position. 
Rampton (1990; 1995) provides a list of characteristics of expertise that do 
derive from this kind of socially oriented perspective: 


1) Expertise is learned, not fixed or innate. 

2) Although they often do, experts do not have to feel close to what 
they know a lot about. Expertise is different from identification. 

3) Expertise is relative. One person’s expert is another person’s fool. 

4) Expertise is partial. People can be expert in several fields, but they 
are never omniscient. 

5) To achieve expertise, one goes through processes of certification, in 
which one is judged by other people. Their standards of assessment 
can be reviewed and disputed. There is also a healthy tradition of 
challenging ‘experts’ (1990, p.98-99). 


In addition to the teachers’ own definition of expertise, Rampton’s five 
points provide a framework of research questions: 


1) How have they acquired their expertise? Was it learned or innate? 

2) What areas of practice are valued most highly by the teachers? How 
closely do they identify with their expertise? 

3) How does their expertise relate to practices of other teachers at work? 

4) What limitations do teachers ascribe to their expertise? Where do 
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they see the boundaries of what they know? 
5) How is their work judged by others? What evidence is there of a 
‘healthy tradition’ of challenging experts? 


2. Methodology 

From November 2001 to March 2002, as part of my research into 
teacher identity, I conducted a series of interviews with native and non-native 
teachers of English working at Japanese universities. Following a research 
methodology suggested by Seidman (1998), I conducted three long, 
unstructured interviews lasting around 90 minutes with each of the teachers. 
In the first interview, the teachers were asked to talk about their academic 
and teaching careers, in the second, about their everyday teaching practices, 
and in the third, about issues and beliefs that affected their work. This paper 
draws on the interview data provided by two of the teachers. Although it 
draws on the transcripts of all three interviews, most of the examples in this 
paper are taken from the second interview in which they described their 
teaching practices. 

The interviews were exploratory, with the aim of eliciting a wide range 
of experiences and opinions relating to the teachers’ subjective view of their 
place within a social context. Questions and prompts were kept to a 
minimum in order to enable the interviewees to extemporize on the subject 
of the interview at their own pace, and in their own words. This follows a 
phenomenological approach which prioritizes the subject’s experience of the 
social world (Schutz, 1967). Transcripts of the interviews are treated as data 
from which conclusions can be drawn regarding the above research 
questions. 


3. The teachers 

This paper focuses on two teachers, who occupy similar positions in 
their universities and whose level of academic qualification is quite similar, 
but whose sense of how their expertise is judged within their work 
communities is significantly different. Both are native speakers of English 
with over forty years of teaching experience between them. Jack is a 
Canadian man in his fifties who taught in Canada, Germany and Poland 
before moving to Japan where he has worked for over ten years. Dave is a 
British man in his forties, whose teaching career has been largely conducted 
in Japan’. 

The teachers in the study both occupied the same institutional position 


' Jack and Dave are both pseudonyms. In addition, other identifying features have been 
changed to protect the anonymity of the teachers. 
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of foreign lecturer (gaikokujin kyakuin kyoshi)’, albeit at different national 
universities. Universities in Japan appoint teachers on the basis of some kind 
of expertise, as attested to by academic qualifications, experience teaching at 
the same level, and/or by recommendation of a known and trusted authority. 
Both Jack and Dave were appointed on the basis of masters degrees and 
several years experience teaching English as a Second or Foreign Language. 
In addition, both were recommended to the university by ‘insiders’. 
Subsequently, both had worked in the same position for some years - Jack 
more than ten, and Dave, seven - time enough to feel established and 
confident in their practice. 

Taking a social perspective means adopting a relativistic, or at least 
open-minded, view of expertise. Unlike Tsui’s study (2003), I have not 
ranked the teachers on a scale of expertise; but neither have I avoided the 
term altogether, as do Connelly and Clandinin (1999). Instead, in this paper, I 
treat both Jack and Dave as experts, without seeking to view one as more 
expert than the other, or to deny the claims that they make about their own 
expertise. However, it will be clear to the reader that, expertise apart, Jack 
appears to be frustrated and unsatisfied in his job teaching at a national 
university, whereas Dave is relatively happy and fulfilled in his job at a 
similar institution. 


4. Teachers’ definitions of expertise 

Although the teachers were not asked directly to give a definition of 
expertise, they were asked to provide examples of people they thought had 
influenced them as teachers. In addition, their interviews are full of value 
judgments of the kinds of teachers and teaching practice that they consider to 
be good or bad, and hence from which definitions of expertise can be 
extrapolated. 

Jack describes the most important influence on his teaching as the 
teachers he met while doing a masters degree in TESL in Canada: 


I think one of the biggest influences was in [Vancouver] when I decided to veer away 
from literature and get to something practical which was teaching English as a Second 
Language. That was the TESL Center at A University. They were excellent teachers, 
great people, some of them became my closest friends, they visit me here, we keep in 
contact. They really taught me a lot, they all love teaching English as a Second 
Language, they were all really great scholars and involved in community work and 


* This is a post that has been phased out since the 1990s by the Ministry of Education, 
Sport and Culture (Monbusho), and has been replaced with a similar position (gaikokujim 
kyakuin kyoin), which retains the marginal status of the previous post but loses the 
preferential financial terms of employment enjoyed under the latter contract. 
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helping people, whatever. I think the TESL Center at A University was really 
important. 


The penultimate sentence contains the main qualities that Jack values: a love 
of the practice and theory of teaching as well as a desire to be socially useful. 
This last point is echoed at several points throughout the interviews. 

To contextualise Jack’s comments, before coming to Japan, Jack 
founded and for several years directed an Adult Immigration Center, a period 
of time which he refers to repeatedly as ‘the golden era’ of his teaching 
career. It was during that time that he completed his masters in TESL, and 
continued his association with the university by working as an adjunct 
lecturer on ESL methodology and by inviting ESL students into his 
classrooms to observe a ‘master teacher’ in action. 

What stands out in his accounts of working at the Adult Immigration 
Center is the emphasis on being of practical help and on the 
acknowledgement of this by his students: 


It was the golden age ... because you felt that the language you were teaching, it was 
not like finishing school, it was something that people could use and they needed. I’m 
not trying to say that this was an act of charity, you know, some kind of colonial 
attitude where you’re helping the poor, there, you’re the white .. you know, native 
speaker helping. It wasn’t that at all, it was that you were really helping people [...] In 
many cases you were helping people that had had horrific experiences: swimming 
across alligator-infested rivers in Cambodia, you know. And they really, really needed 
the language and they wanted to help themselves. And they really appreciated you a 
lot. And I had complete free rein to develop the program as I liked. With cultural 
excursions, materials, textbooks, really anything we wanted. We had to stay kind of 
within the guidelines of what the government, you had to teach them a little bit of 
math every day, like 5 minutes. And you had to have some books for literacy type 
teaching reading. But basically you could work around that. So as long as they were 
getting the language. The teaching staff was fantastic. That was another thing that 
made it a golden age, you know.... 


For Jack, then, expertise entails not only knowledge and know-how, but 
crucially, being in a situation in which students have a real need which he is 
in a position to address. In this case, Jack was not limited by institutional or 
administrative constraints (‘I had complete free rein to develop the 
program’), nor did he lack the support or cooperation from the other teachers 
(‘the teaching staff was fantastic). Moreover, the success of Jack’s expertise 
is proven by the reaction from students (‘they really appreciated you a lot’). 
As he mentions elsewhere, good teaching is ‘a two-way street’, where 
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students learn something of value from the students. Jack’s portrayal of the 
Adult Immigration Center offers a yardstick of ‘good teaching’ by which his 
other experiences, and particular his current experiences of teaching in Japan, 
fall short. 

The concept of expertise that is expressed in Dave’s comments similarly 
entails more than knowledge and know-how. Dave has a particular interest in 
teacher development, and at the time of the interviews was nearing 
completion of a doctorate in education on the subject. Accordingly, he had a 
lot to say about the development of his own expertise as a language teacher. 
As in Jack’s case, the role models that Dave cites are significant more for a 
particular attitude they convey towards teaching and students, than for the 
details of the practice itself. 

Dave first mentions a negative influence, a French teacher from the 
time when he was a high school student: 


.... this is one of the few teachers who I do model myself on what not to do. It was the 
French teacher who used to take the piss out of me and be humiliating and made me 
not want to speak French at all. 


This is contrasted with one of the teachers he met at International House, 
where he first trained as a teacher and obtained his RSA certificate: 


She became a really good friend, and she was a really fantastic teacher. So there’s 
another role model. I mean she was so .. well. What she had was an affinity for 
foreigners and she’d be really helpful and she’d smile and talk to them nicely. She had 
a calm manner, she’s not one of those who comes into a room in a really gung ho 
manner, tells loads of jokes and everybody loves immediately, but she had a very calm 
manner and people would trust her immediately. Just a wonderful character to be with. 


Although Dave does not claim to be similar in character to this teacher (he 
admits ‘I am quite a sarcastic person and I do like to make jokes with the 
students’), it is clear that good teaching involves the ability to create a 
‘comfortable’ atmosphere in the classroom, where students can develop their 
skills in a non-threatening environment. 

Like Jack, Dave also talks about his impact on students, but it is more in 
terms of the measurable improvement he can detect in particular skills. For 
example, he describes how when he started teaching at the university, he had 
no experience in teaching writing. However, over seven years he has built up 
an expertise of which he feels proud: 


I kind of tried hard to find out about it, and thought about it a lot. I realise it’s an 
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enormous area and I’m... just dabbling with it. But I think I’ve got some good systems 
now. And I can see how it works and can see some improvement in the students. I 
collect everything that they write. This is a year’s worth of stuff. And if you look 
through here you can see the improvement of the students.... 


Dave’s students show their appreciation by selecting his class in preference 
to that of his colleague, who, he notes with annoyance, is left with only a 
handful of students, compared with Dave’s classes of forty and more. 

While Dave appears to be more satisfied and perhaps more successful 
than Jack in terms of his teaching practice with students, he is less sure of his 
standing in a professional community. Whereas Jack refers back to the 
golden age when he was accepted by the academic community at the 
university where he completed his masters degree, Dave was at the time of 
the interview applying for jobs in teacher education in the UK, that is, was 
seeking but had not yet gained acceptance in the language teaching 
profession at a higher level. Whereas he feels that he is fit for such a position 
in terms of his commitment to developing his teaching practices and in terms 
of the expertise he now has, he is apprehensive that this may not be sufficient 
to secure him a job at that level: 


This morning I’ve written up my CV and I’m realizing I should have done other things. 
When you do things there’s a part of you that says oh that’d be good for the CV, but I 
must admit that’s not been prominent for me. It’s been oh that would be an interesting 
experience, and I can gain that experience and it might be useful to improve me as a 
teacher educator or researcher. But just recently like paper qualifications are important, 
yeah, because that’s how you get judged. 


As Dave’s comments show, experience and skills are in themselves not 
sufficient criteria for determining expertise. Although he has worked over the 
years to improve his knowledge and skills for the sake of his current job and 
for the sake of his image of the kind of teacher he would like to be, as job 
seeker, he is now aware that his definition of an expert teacher/teacher 
educator/researcher is not necessarily the same as that of prospective 
employers. 


5. Expertise: acquired, fixed or innate 

Both Jack and Dave have decades of experience in teaching, and both 
have stories to relate about the process by which they acquired their teaching 
expertise. As we have seen in the previous section, Dave has developed 
much of his expertise on and through the job. With writing instruction, 
because he had no previous experience, he attended a writing instruction 
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workshop, where he met a group of teachers who continued to meet regularly 
to share and plan research on problems in teaching EFL writing. Dave’s 
action research on his own students has provided him with the ‘systems’ that 
he values at present, but he has also benefited from the collaboration with 
other teachers in analogous working situations at other universities. He also 
relates how he has initiated other support groups of teachers and 
postgraduate students at his university, as a way of raising and dealing with 
common problems. Collaboration with others in order to continually improve 
practice (Edge, 1992; 2001) is a guiding principle in Dave’s attitude towards 
his work generally. 

Jack, by contrast, does not say that he engages in this kind of 
self-development at his university job, although he does keep abreast of 
developments in the teaching profession through conferences and 
professional literature, and pursues training and development through an 
American university for which he teaches part-time. Unlike Dave, Jack 
suffers from what he perceives to be a xenophobic attitude towards foreign 
teachers. Although he believes that his expertise is at least the equal of his 
Japanese colleagues, his value in the eyes of the Japanese university is 
limited to his identity as a native speaker. Foreign EFL teachers are seen as 
‘walking dictionaries’, useful only insofar as they are native speakers of a 
language. Apart from their usefulness as native speakers, foreign teachers are 
regarded as less authoritative than their Japanese counterparts: 


There’s also an attitude that English, at least the classes we teach, that they’re not 
serious, because as you know there’s kind of a division between what the experts teach, 
and what the native speakers teach. 


However, while he resents the fact that he is positioned in this way, Jack 
does in fact sometimes emphasise his linguistic identity as paramount in his 
role in the classroom. For example, discussing his teaching of writing, he 
argues: 


... Some people would say, well, you’re forcing this kind of Western type of discourse, 
organization, and that’s not .. students should be writing their own cultural style and 
they should be writing haiku in English, and they should be writing ki-shou-ten-ketsu. 
Well, if that’s the case, what’s my role as a foreign lecturer. I see my role here as 
exposing students to different cultural set-ups, in this case different writing styles. 


For Jack, there is an awkward contradiction between his sense that his 


identity as a foreigner is what makes him valuable to his students, and his 
resentment that this is so. 
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6. Expertise and affiliation 

According to Rampton (1990), expertise is not something we 
necessarily feel close to. Although this is open to interpretation, I take it to 
mean an emotional attachment or identification with the subject or skill that 
has been mastered to an expert degree. If the subject or skill is taken to be 
the English language, then as native speakers, Jack and Dave have nothing to 
say on the subject, since they have no alternative to which they might feel 
closer. A love of their language or culture may be evident in some of their 
teaching practices, however. Jack, for example, has always brought in 
cultural realia, such as food, for his students to experience, and likes to use 
teaching materials that include some Canadian cultural content. Dave, too, 
relishes introducing the students on the British Studies course he teaches to 
aspects of British culture, such as cricket (which he plays in the classroom) 
and the pub (to which he organizes an outing). However, unlike Jack, he 
expresses misgivings about the usefulness and the accessibility of this kind 
of course content to the students. 

Perhaps quite naturally, neither Jack nor Dave expresses any negative 
feelings towards their native language or culture. Jack makes a number of 
comments, however, about an institutionalized negativity towards English 
and English culture in Japanese universities. At one university where he 
works part-time, he says: 


The faculty there have got this new idea that people can take a double dose of English. 
I like the way, you know, a double dose already tells you about their attitude toward 
English. You take a double dose and finish it in one year and then you never have to 
look at an English book again. 


Similarly, at his own university, he criticizes a course video and textbook that 
his department had designed for first and second year students: 


It was a trivialization of culture, of if culture was there it was negatively seen. It was 
either slavery or discrimination or exploitation or the culture was seen as trivial, as 
pop culture. 


The result of this institutional negativity towards English is to induce in 
students what Jack calls ‘negative motivation’, and Jack finds this very 
difficult to overcome. On a number of occasions he alludes to a feeling of 
exhaustion, which has led him to invest less time and effort in his classes and 
expect less of his students in return. 
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7. Expertise in work communities 

As has been touched on in the teachers’ definitions of expertise, the 
social experience of these two teachers is very different. Whereas Jack feels 
that he is undervalued by the university where he works, Dave has good 
relations with colleagues and staff, and feels himself to be an integrated 
member of the work community. 

In talking about their narratives about work, both teachers create 
‘us-them’ distinctions’ in describing the social organization of their university 
communities. Jack, for example, sees his department as divided into groups: 


..one is the old philological, the old boys school .. where people will have freshmen 
translate Wittgenstein from English into Japanese or read Beowulf. It’s very 
philologically based. The other, which is very small, basically the young Turks, these 
are people who had spent an eight week-course at Reading University and become 
applied linguists, hem, hem. 


Needless to say, Jack does not imagine himself as belonging to either of 
these groups. Nor does he see himself as a typical foreign teacher, 
conducting conversation classes and giving the students an ‘easy ride’. 

Jack sets himself apart from other teachers at the university as someone 
who is professional and serious about teaching students important academic 
skills (EAP). Criticising the lack of professionalism of his colleagues, he 
states: 


... ’m the only person here who has a sign out during the semester that says office 
hours, and the students know that I have office hours. The students get a syllabus at 
the beginning of a course, and that is not standard practice. 


Jack finds little support in this community. One lengthy anecdote that he 
relates epitomizes for him the bad teaching environment at his university: 


Last year, a student came to me and said ... well actually first I got a letter from the 
ombudsman that works between the students and the department that said Miss 
So-and-so says that she got the wrong grade. [...] Obviously the student wanted to go 
to law school. She was missing some points and she wanted her points up and she was 
going round asking other professors and asking them, ‘could you lift the grade up?’ 
and I said ‘No, I can’t change the grade; that’s what she got’. It was too late, and she 


3 These kinds of social descriptions are referred to as ‘membership categories’ in social 
psychology literature (Sacks, 1974; Tajfel, 1978; Oakes, Haslam and Turner, 1994), and 
as ‘rhetorical redescription’ by Harre and van Langenhove, 1999. 
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got the exam papers back. And she said, ‘Well I only got one exam paper back’. And I 
said, ‘Well if that’s the case, you know, how would you know what your grade was? 
You wouldn’t have known you got such-and-such on the exam’. Anyway, in other 
words everybody knew. In fact the boss knew that there was something but she 
threatened to sue the university. And this went all the way up to the dean. But I just 
refused to do it. Now probably Bob and other people who’ ve been here longer than me, 
they said, ‘Why didn’t you just sign it and get her out? You would have avoided 
problems’. Now my boss doesn’t speak to me. The chairman - of course, he’ll be out 
next year so that’s OK - but it was this lack of support ... I had the feeling that ‘Oh, 
the foreigner botched up, and we’re going to support the student’. It was, I almost had 
a nervous breakdown. It wasn’t because I felt that I might have made a mistake. It was 
because the support was not there when the crunch came. 


The disputed grade incident and the institution’s decision to side with the 
student are interpreted by Jack as undermining his expertise and authority. 
The effect of the story is to establish a moral victory for Jack, but at the 
expense of his sense of integration within the work community. It is other 
people — the dean, the chairman of his department, and even his foreign 
colleague, on whom he has in other circumstances relied for support — who 
are shown as acting unreasonably and unethically. Jack feels himself to be 
marginalized in his work community but at the same time feels that he is 
right and they are wrong, rendering it less likely that relations at work will 
improve. This marginalisation may be actual, but in demonizing the 
university in this way, Jack justifies his growing tendency to give less of his 
time and energy to his classes and colleagues, and to expect less of them in 
return. 

Dave, by contrast, enjoys good relations with the majority of his 
colleagues. Good rapport with people he works with is important: 


I really like to be part of a community and I want to discuss things with other people 
and make things better and have a social life at work, you know. I’m not particularly ... 
I don’t have the disposition where I divide work and home, the two shouldn’t meet. I 
think it’s important that work colleagues get on and see each other socially. They don’t 
have to be best mates, but it helps if you do things together out of the work situation. 
More fundamentally, I think there’s lots of things wrong with the way things are run 
here and we need to get together to improve that. And if we don’t have any rapport, if 
we don’t meet, we can’t work collaboratively to improve things for the students. 


Although he appears to have this kind of relationship with many of his 


colleagues, the other foreign lecturer in his department stands out as 
someone with whom he has no such relationship, and who is depicted as 
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someone who is as different from Dave as possible. In the following extract, 
Dave tells the story of a recent incident in which the colleague in question 
behaved in a way that Dave found offensive and embarrassing, and which 
seems to epitomize both the American colleague’s attitude to work, and 
Dave’s position in direct opposition to that attitude. The narrative was 
prompted by the colleague knocking on the door and inquiring whether Dave 
was affected by a water leak emanating from another office. As soon as the 
door was closed, Dave confided that he detested this man: 


There’s a long .. not a long history, but one or two incidents have happened that have 
soured the relationship. Basically, I’m interested in language teaching and he isn’t, and 
he’s academically snobbish and I’m not. And we have completely different views 
about why we’re here. So I don’t want to talk about teaching or anything, which is 
why I would like to talk to him, sorry, I don’t know if he avoids me, but I definitely 
avoid him. I don’t go out of my way to talk to him. [What does he teach then?] He’s 
supposed to be like me, an English teacher. I have, you saw that list of writing students, 
between 40 and 50 students in my class; he has two. Because he has this image that 
he’s in an Oxford college giving tutorials to people about things that he’s interested in, 
like politics and geography and, er, biblical cartography, you know, which most 
students are incredibly interested in. And he’s definitely not a language teacher, 
doesn’t want to have anything to do with language teaching. But we’re supposed to 
share, you know, the burden of teaching language, in his eyes burden. So anyway, he 
makes me sick, really. 


The portrayal of the colleague includes a view of expertise that contrasts 
directly with Dave’s and serves to cast it in sharper relief. Dave’s colleague 
clearly sees himself as an expert, but it is a kind of expertise that Dave 
believes has little relevance for the students. Dave’s descriptions of the 
colleague as ‘academically snobbish’, and as an imagined Oxford don, 
highlight the fact that expertise, particularly as it is manifested in higher 
education, is often a form of elitism, a way of establishing and maintaining 
certain power relations. According to the colleague, teaching language is 
seen as a ‘burden’, of less importance than how he views his main role, 
which is to impart his views as an ‘expert’ in politics, geography and biblical 
cartography. There are, of course, parallels with Jack’s portrayal of the 
Japanese professors in his department. 

Dave goes on to relate two specific incidents, which justify his hostility 
towards the colleague. In the first, he recounts how he and other members of 
the department had tried and failed to reason with the colleague after an 
accusation by a student of sexual harassment. In a second anecdote, Dave 
had been present when the colleague accused the administration staff of 
denying him an entitlement to money for books and materials. In both cases, 
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Dave allies himself staunchly with the Japanese department or admin staff, 
and is embarrassed to be associated with the colleague, by virtue of the fact 
that they are the only two foreign teachers in the department. 

Dave does not always agree with the ideas or practices of his Japanese 
colleagues. In the following passage, he describes what happened when he 
was invited to participate in the students’ presentation of their graduation 
thesis: 


That was a bit of a shock to me because the other teachers were incredibly critical. It 
was their last day at university (laughs) and one student who wrote in English, she was 
the only one who did, because they usually write it in Japanese, she got a real battering 
from these other teachers. I stood up and said, um she should be admired because she 
wrote it in English, and it was actually a pretty good piece of writing and blah, blah, 
blah. And she wasn’t crying up until that point (laughs). But then she started crying, 
you know... She said thank you for protecting me. But that’s an interesting little 
difference between my colleagues and myself. I want to be supportive to them, and 
there’s a definite feeling that, there was no praise, for example, there was no, oh this 
part was really good, this part needed work, and you could do A, B, C. There was no 
praise at all. It was all just a list of criticisms, Not just from one teacher, but the same 
from all of them. 


Dave is aware that there are striking differences between his style and that of 
his colleagues, but he remains willing to participate in joint activities and to 
look for ways of collaborating with them to solve common problems. 


8. Expertise and specialization 

As we saw in the previous section, there are contrasting views in higher 
education on what constitutes best practice in language teaching. Both Jack 
and Dave are critical of teachers who see themselves as specialists in ‘arcane 
subjects’ such as biblical cartography or translating Wittgenstein from 
English to Japanese. 

Specialised expertise, however, may be regarded as more appropriate at 
university level. Holliday (1994), for example, suggests that current 
language teaching methodologies derive from a different educational culture 
or code from that which prevails in higher educational institutions, and that 
problems arise when teachers try to adopt those methodologies without 
changing other aspects of the educational culture. Other researchers of 
teachers in both EFL and ESL situations’, however, report that language 


4 distinguish between English as a Foreign Language, which is taught in countries where 
English is not an official language, and English as a Second Language, which is taught in 
countries where it is. 
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teachers always tend to be marginalized in university departments that are 
dominated by specialist academics (e.g. Johns, 1997; Johnston, 1997). 
Particularly in ESL contexts, language teaching is intended to enable 
students to cope in the subjects which they choose to specialize in. However, 
language learning is thereby seen as merely instrumental in the pursuit of a 
more important purpose, and language teaching is thereby relegated to a 
level below that of other subjects. This attitude towards language teaching in 
higher education is apparent to both teachers. Jack sees it in the distinction 
between ‘experts’ and ‘native speakers’; Dave detects it in his colleague’s 
attitude toward language teaching as a ‘burden’. 

Nevertheless, both teachers do regard what they do as specialized in a 
particular way. Jack makes this explicit when he protests against the 
expectations of language teachers to teach general or conversational English: 


We’re supposed to be teaching conversation and so on. I don’t do that. I teach English 
for Academic Purposes. So basically, my courses are content based. So they learn 
academic skills, listening to lectures, writing for academic purposes, and so on. I tell 
them right at the beginning, if you want conversation, you go to Berlitz or ECC. This 
is English for Academic Purposes; that’s how I teach. But the attitude toward foreign 
lecturers is that we should be doing conversation, you know, what time does the train 
leave Paddington Station? What did Aunt Jenny put in her suitcase? You know, I’m not 
going to teach that. 


Dave too teaches English for Academic Purposes, particularly as regards 
teaching writing; however, he is less rigid in his view of what that entails. 
Like Jack, Dave also has no time for the practice of English in imaginary 
situations, such as ‘what time does the train leave Paddington Station?’ 
However, he is more inclined to let the students determine the content of his 
classes. Whereas Jack brings cultural content into the classes, Dave allows 
students to choose their own topics for writing, and in his oral 
communication classes, topics are chosen from those which he has found 
productive with students in past classes. Dave notes that this is English for 
‘pseudo-academic purposes’. Most of the students will never use English for 
any other academic purpose besides the classes that he teaches. Jack too is 
aware of the contrived nature of EAP in the EFL context, but nevertheless 
feels that an excessively generalized approach to English is too easy, and 
therefore not appropriate for higher education. 


9. Challenges to expertise 


Rampton (1990) notes that expertise is attested to by a process of 
certification, and that it is always open to challenge. Both Jack and Dave 
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have both been through a process of certification, both with postgraduate 
qualifications in English Language Teaching. In both cases, however, they 
feel that their expertise has not been tested, nor is it really valued at the 
universities where they teach. Although they had the requisite qualifications, 
both teachers felt that they were hired to fill posts in a hurry and that their 
appointment was made on the recommendation of others who barely knew 
them, as in Dave’s case, rather than on their expertise as teachers or scholars. 

Jack is scathing about the levels of expertise he perceives in his 
colleagues. He also finds his fellow-teachers to be reluctant to share or 
expose what they do in classrooms. 


The social element is not there ... the social element means that you’re supported by 
colleagues, and colleagues listen to you. Like action research, you can say, now listen, 
I’m having a problem teaching this, and how would you do it, that kind of thing, [yes, 
yes] I would never ask Japanese faculty. In fact the foreign full-time faculty said they 
would, but then they fudged on it. They didn’t want to do it. [What was that?] I had a 
graduate course on research methods in second language teaching and one of the tasks 
the students had to do was observe a class and write up an observation report what was 
going on in the class. 


In the same vein, he criticizes his university’s attempts to monitor and check 
teachers’ performance because he feels they focus only on administrative 
issues, such as punctuality, rather than on pedagogical quality and 
effectiveness. In his part-time job, Jack is more content that the 
administration keeps a far closer eye on what teachers do, and that it is 
genuinely interested in the professional development of its teachers. 

Dave’s situation at work may be similar, although he is far less critical. 
Unlike Jack, however, he does not expect his university to provide training 
or opportunities for self-development. Nevertheless, Dave has found many 
opportunities to engage in self-development, both at work, through his 
ongoing academic study and through his activities with professional 
language teaching organizations. Significantly, Dave has been able to 
collaborate with colleagues because he has responded to their needs, while 
they in turn have granted him entry into the work community. 


10. Conclusion 

This paper has presented narratives from two teachers and makes no 
claims to offer comprehensive or universal claims about the nature or the 
content of expertise in Teaching English as a Foreign Language. What I have 
tried to indicate, however, is the importance of the social dimension of 
expertise, that is, how a teacher’s practice is valued by their students, by 
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colleagues and administrators, and how these external valuations fit or clash 
with the teacher’s own self-worth. 

As Jack and Dave illustrate, teachers do not necessarily define expertise, 
or best practice, in the same way. Jack, as we saw, was influenced by his 
experience of teaching adult immigrants in Canada to place a high value on 
meeting students’ real life needs and on an acknowledgement of his role in 
changing students’ lives. Dave, by contrast, has not had that kind of 
experience, or that degree of authority in his teaching career, and 
consequently, his ambitions are lower, and he experiences greater 
acknowledgement and success among his students and his local academic 
and professional communities. 

I conclude by returning again to Rampton’s (1990) definition of 
expertise. As we saw, Jack and Dave both feel that their expertise has been 
learned, through effort and experience, and both of them feel that this is an 
ongoing process. One of the reasons for Jack’s dissatisfaction, however, is a 
perception that he is valued more for his fixed or innate skill as a native 
speaker, than as an expert educator. In fact, Jack himself is somewhat 
ambivalent regarding this aspect, viewing his cultural and linguistic identity 
as something that does actually give his authority over his students. Both 
teachers, however, view their expertise more in their pedagogical knowledge 
and skill, and both express strong feelings towards various teachers in their 
past, on whom they have modelled their own practice. 

Finally, it must be conceded that the analysis of expertise in this study is 
complicated by the fact that both teachers are foreigners, members of a very 
small minority in Japanese higher education. In its social aspect, as Tsui 
(2003) has shown in her case studies of teachers, and as I have argued in this 
paper, expertise entails an ability to communicate with and to learn from the 
teaching context. The fact that Jack and Dave are foreigners in a Japanese 
educational context differs in degree but not in kind from the situation of 
teachers seeking approval and authority in any educational institution. From 
his accounts, Jack’s working environment does not appear to permit that kind 
of reciprocal openness, but Jack himself expresses a negativity towards it 
that keeps him at a distance. Dave, by contrast, has succeeded in breaking 
down barriers and being (at least feeling that he is) admitted as a member of 
his academic community. Dave is not oblivious to the considerable 
differences between the way he and his colleagues do things, but he does not 
criticise their practices or question their worth. In maintaining an open and 
positive attitude towards students and colleagues, he is able thus continually 
to apply and develop his expertise in ever new and challenging situations. 
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Learning to Reframe — Japanese 
Benefactives, Metalinguistic Beliefs and 
the Identities of L2 Users — 


Barbara PIZZICONI 


1. Background 

This paper examines the relationship between language learning and 
learners’ beliefs regarding it. It is based on learners’ “stories,” i.e., the 
learners’ own accounts of “meanings” in the target language—Japanese'—the 
meaning of one particular family of constructions—benefactive 
constructions—and the learners’ assessment of their own learning process. 

The original work on which this paper is based is a study of the patterns 
of acquisition of Japanese benefactive constructions by native speakers of 
English. In two previous papers, I have discussed the linguistic issues in 
non-native acquisition (Pizziconi 2000a) and the concept of “benefit” in the 
linguistic usage of native speakers of Japanese (Pizziconi 2000b). In this 
paper, I return to the discussion of the performance of non-native speakers; 
here, however, I will focus on metalinguistic knowledge and beliefs and what 
(if anything) they can reveal about the processes of language acquisition and 
acculturation’. 


: Here, the usual proviso is applicable to the language referred to in this study—the 
standard variety used in Japanese media and based on the dialect spoken in Tokyo. 

At the outset of this research, its theoretical orientation was considerably different from 
that illustrated in this and in the subsequent section; this is naturally reflected in my 
initial questions, data collection methodology, and terminological apparatus. However, 
the learners’ comments, which I had not previously discussed, appeared to provide further 
insights on the issues discussed in this paper. In fact, the analysis of the learners’ 
comments in relation to the data on actual performance appeared to indicate certain facts 
that could be understood in a different theoretical framework—the sociocognitive 
approach that I briefly outline in this paper. The study probably prompts more questions 
than it answers; yet, it is the first step in an exploration of a new perspective on how 
socially and culturally sensitive linguistic facts are learned and acquired. 
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The system of benefactive constructions in Japanese is an extremely 
intriguing subject of enquiry; this is because the acquisition of this system 
involves different types of skills—grammatical, sociolinguistic, pragmalinguistic, 
and sociopragmatic. Although some of these are often treated as separate 
entities in the literature on second language acquisition (SLA), it is obvious 
that the boundaries between them are far from definite (Kasper and Rose 
2002). 

A sociocultural approach to language learning, by virtue of its shift in 
focus from language as a self-contained and independent semiotic system to 
language as a tool for acculturation and socialization (Schumann 1976, Ochs 
and Shieffelin 1979), allows for a paradigm change due to which the 
traditional boundaries between disciplines are no longer taken for granted. 
Consequently, the interactional context is considered to be instrumental in 
the way in which grammar is interpreted and represented. Going a step 
further, some researchers have proposed that cognitive and affective factors 
are not dissociated from social factors; rather, from an ecological perspective 
on language-learning organisms, these factors are inextricably integrated in 
networks of relationships with other organisms in complex and rich 
interactive environments (Atkinson 2002) 

The affective and cognitive factors involved in language acquisition are 
often assumed to operate at a subconscious level; in this article, I explore the 
aspects of these factors that may emerge at the conscious level. Although I 
am interested in the ways in which social and cultural issues affect language 
learning, and while I subscribe to the (neo-Vygotskian) view that language is 
an intersubjective accomplishment, I will focus on the individual rather than 
the group, and on the manner in which the individual generates cognitive 
representations of the target language. 

The “voices” presented in this study no longer comprise raw data 
obtained from situated exchanges but are a necessarily subjective selection 
and (re)composition of diverse commentaries—with all the perils that this 
entails. This selection has aimed to highlight the fact that the relationship 
between learners’ folk theories about language learning and the available 
evidence on the actuality of their performance in that language is often 
ambiguous. 


2. Some working assumptions 

One of the widely held assumptions regarding successful SLA is that a 
positive attitude toward the target language is conducive to acquisition. 
Research on the relationship between language success and learners’ 
attitudes has addressed aspects such as attitudes toward the “self” (Siegal 
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1995, 1996, and her subjectivity theory’) and toward the native and target 
cultures (Schumann 1976). Currently, it is held that successful intercultural 
interaction requires “attitudes of curiosity and openness, of readiness to 
suspend disbelief and judgment with respect to others’ meanings, beliefs, and 
behaviors” (Byram 1997: 35), rather than merely “positive” attitudes. 
Understanding language as an integral part of culture entails an assumption 
that the very attitude toward the target language itself—inasmuch as it 
impacts on behavior—is a subject worthy of enquiry and is one of the factors 
that influence language learning processes and outcomes. Past studies have 
investigated learners’ epistemological beliefs regarding language and 
language learning (Mori 1999*, Wenden 2001); however, there still exist 
practically no references to the learners’ evaluative judgments of the 
pragmalinguistic features of the language and what (if anything) such 
features may reveal about their competence. 

Strong motivation is belived to be conducive to learning via the 
mediating role of attention. An interest in the target culture, the desire to 
integrate, and a positive appreciation of its language are considered to be 
accelerating factors. In contrast, it is generally accepted that among other 
factors, a negative opinion of the target language and culture can indirectly 
function as a “filter” (Krashen’s 1985 affective filter hypothesis articulates 
this) or an obstacle to the development (according to Schumann’s 1976 
model, a large social and psychological distance prevents acculturation). 
Given these working assumptions, it is possible to expect a tendential 
correlation between the learners’ attitudes toward the language (and in 
particular, any parts of it which may be conceptually novel to the learners) 
and their proficiency. Although I did not posit any strong hypothesis 
regarding a relationship between the number of benefactive structures used 
by the learners and their general proficiency, I did expect to observe a 
developmental curve in the learners’ ability to incorporate benefactives in 
their rhetorical repertoire, parallel to an increasingly positive or at least 


Siegal (1996: 360) delineates three qualities of subjectivity, all of which are relevant to 
the interpretation of the learners’ comments discussed in this paper: a. the multiple nature 
of the subject, b. subjectivity as the site of a struggle, and c. subjectivity as varying over 
time. The changes in a learner’s life influence his or her subjectivity, language use, and 
language awareness, and at any given time, different facets of the self may enter into 
complex relationships of conflict. Thus, the concept of subjectivity is both synchronically 
and diachronically dynamic. 

Mori suggests that “beliefs are in part a function of experience and learning,” and that 
“the nature of the learning experience could affect the formulation of student beliefs.” 
These are themes that will resurface in my discussion. My commentaries will also show 
that “Advanced language learners are less likely to believe in simple, unambiguous 
knowledge or in the existence of absolute truth than novice learners” (1999: 409). 
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“open” metalinguistic evaluation. Later discussions will reveal that this 
assumption was not corroborated by the data, at least not in any 
straightforward way. 

Another important assumption, motivated by recent developments in 
linguistic theorizing, that, in my opinion, is relevant to the examination of 
benefactive constructions, is that if grammar is considered to be “rooted in, 
and shaped by, everyday language use” (Ford et al 2003: 119) rather than an 
independent module of human cognition (Hopper 1998, Marmaridou 2000), 
then language acquisition should be conceived of as an interactionally 
sensitive process. 

Several recent approaches to language acquisition consider interaction 
as having a foundational role in the development of a grammar (Kasper and 
Rose 2002: 166). Some examples of such approaches are Long’s (1996) 
interaction hypothesis, Swain’s (1995) output hypothesis, Lantolf’s (1999) 
sociocultural theory, and the various views on language socialization (see 
Lantolf and Pavlenko 2001, Atkinson 2002 for a review). According to these 
recent views, a crucial trigger of grammatical competence is not abstract 
syntactic rules, productive as they may be, (in fact, the sociocognitive 
approach to grammar entirely transforms the way in which rules are 
conceived), but the conventionality (or rather institutionality) of the social 
uses of language. These uses themselves are embedded in the grammar of the 
language: “We understand grammar as a minimally sorted and organized set 
of memories of what people have heard and repeated over a lifetime of 
language use, a set of forms, patterns, and practices that have arisen to serve 
the most recurrent functions that speakers find the need to fulfill” (Ford et al. 
2003: 122). In these models, collocations, frequent repetition, and routines 
that are crystallized into ‘rituals’ are assigned a crucial “scaffolding” 
function, and the sociocultural milieu in which such linguistic interactions 
occur is considered as a structuring factor rather than as a mere background. 

The type of learner data presented in this paper allows for the 
speculation that the acquisition of benefactive constructions—linguistic 
structures that have social and affective correlates—is facilitated by exposure 
to social practices (i.e., behavior that is routinized rather than analyzed), and 
that routines can become far superior carriers of social meanings than 
metalinguistic explanations, overt instruction, or normative prescription. 
Routines, in fact, do become such carriers despite the apparently conflicting 
beliefs of the learners regarding the putative meanings of grammatical 
constructions. A previous analysis (Pizziconi 2000a) provided no support to 
the claim that syntactic competence must precede pragmatics. In this paper, 
I claim that the pragmatic meaning expressed by the benefactives must be 
learned anew in Japanese, and I propose that participation in meaningful 
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social interactions in varied contexts is what allows for the emergence and 
establishment of new “labels” for events. Thus, this participation is likely to 
drive lexico-syntactic competence. I maintain that a productive use of 
benefactives—a truly “pragmatic” use that allows the expression of the 
learner’s subjective (strategic) intention rather than an imitative, formulaic 
use—is based on formulaic use and made possible by the processes of 
delexicalization’ and routinization (Hopper 1998: 159)’. 

Prior to illustrating the way in which the learners’ metalinguistic 
comments and narratives may support this view of language learning, I will 
first provide an overview of my own characterization of these structures in 
the subsequent section; I will then examine the learners’ comments and 
describe their own characterizations. Despite the terminological differences, 
it will be observed that the subtlety of the learners’ representations increases 
with an increase in their proficiency. However, despite their increasingly 
sophisticated descriptions, the analysis will reveal the way in which different 
goals within lay discourse can be responsible for the apparent inconsistencies 
in reported and actual behavior and in the meanings attributed to these 
structures. Further, the analysis will reveal the different agendas 
underpinning such lay discourse, and scholarly discourse. 


3. A linguistic account of benefactive constructions 

I have previously referred to benefactives as structures that characterize 
a communicative style (Pizziconi 2000b). I use the term “communicative 
style” to refer to a culturally preferred representation of events—a “way of 
speaking” a-la Hymes—that is available to the speakers of a language on 
account of the existence of specific linguistic resources. Like Maynard 
(1998), I have also referred to benefactives as “habitual conceptualizations of 


Skehan (1998: 33) argues for a reevaluation of the role of the lexicon in language studies 
as against what he considers an overemphasis on the rule-governed basis of language use; 
he then quotes Sinclair (1991 Corpus, Concordance Collocation, Oxford University 
Press) in order to point out that “many frequent words become delexicalized since they 
enter into frequent collocations and phrases, and they lose meaning to the collocation as a 
whole.” Benefactive constructions are one such case in which the frequent occurrence of 
benefactive verbs as auxiliaries results in a dimming of their “literal” meanings and 
generates a quickly recognisable—because formulaic—frame that adds a modal nuance 
to the event depicted. 

“The fading of existential-situational meaning that occurs when any action is constantly 
repeated until it becomes routine...we say things that have been said before. Our speech 
is a vast collection of hand-me-downs that reaches back in time to the beginning of 
language. The aggregation of changes that are made to this inheritance on each individual 
occasion of use results in a constant erosion and replacement of the sediment of usage, 
which is called grammar.” 
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the world” (ibid.); if language is, in a broad sense, responsible for the 
specific “ways” in which we construe reality, benefactives contribute to this 
specific cognitive “construal” by requiring the speaker to position 
him/herself in a modal dimension that involves a computation of 
interactional effects and an indication of the polarity of those effects (i.e., 
whether they are positive or negative). It would be useful to characterize 
benefactive construals as extremely productive “metaphors” (Lakoff and 
Johnson 1980), as a kind of rhetorical device that allows speakers to 
construct a particular (and, crucially, culturally specific) conceptualization of 
human relations. 

While I will refrain from engaging in cross-linguistic comparisons, 
which are beyond the scope of this study, I would like to mention that it has 
been noted (e.g., Mizutani 1985) that the Japanese language commonly 
utilizes enunciations that picture a scene from the perspective of the 
experiencer (b. in the examples below) rather than from that of an ideal 
external spectator (a.). As an extension of this tendency—evident in cases 
like b. or d—the grammar allows for the specification of a quality of these 
subjective experiences. The beneficial framing is a characterization that is 
available to and routinely exploited by Japanese speakers (Clancy 1985, 
Miyachi 1999’) because of the availability of benefactive structures’. 


a. Someone took my wallet (OBJECTIVE) 
SUBJ Vv OBJ 
b. (watashi wa) saifu wo toraremashita (SUBJECTIVE) 
I-SUBJ. OBJ V-PASS 
My wallet was stolen (lit. I was stolen my wallet) 


c. My pupils worked hard (OBJECTIVE) 
SUBJ Vv ADV 


: Although I will not discuss the diachronic development of these structures, I will, at this 
point, take note of Miyachi’s (1999) suggestive observation that this is a phenomenon 
that characterizes modern Japanese based on some evidence that such forms were not 
systematically employed until the mid-seventeenth century. The fact that at that very time, 
Japanese honorifics undergo a general trend toward simplification is, for Miyachi, not a 
casual co-occurrence. He notes that the trend toward a focus on the interactional 
dimension (i.e., the speaker’s interlocutor) and the situation-specific assessment of the 
relationship between the self and others provide the motivation for the birth of this new 
cognitive and linguistic construal. 

Note that even when an utterance is constructed in an active voice and with a third person 
subject as in d., the benefactive auxiliary allows the entire construction to be anchored to 
the speaker’s point of view. 
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d. seito ga ganbatte kureta (SUBJECTIVE) 
SUBJ V-TE BEN 
(My) pupils worked hard for me. 


(SUBJ = subject; v = verb; OBJ = object; PASS = passive; ADV = adverb; BEN = benefactive). 


Therefore, benefactives and “adversative” passives resemble two sides 
of the same coin, i.e., of a “modality of interactional effects” or of 
“jnterconnectedness.” ° The “modality of interconnectedness” is a structural 
resource (linguistically embodied in benefactive verbs and passive 
constructions) by virtue of which speakers can characterize or frame actions 
and events in the external world as facts with a special subjective relevance 
for individuals as opposed to facts that are depicted “objectively.” 
Benefactive constructions achieve this effect by means of the metaphorical 
use of a set of giving-receiving verbs. On account of these linguistic forms, 
speakers are able to activate a frame of benefit transfer, and therefore, are 
able to index an affective stance of “interconnectedness.” I use the term 
“frame” in the Fillmorean sense, i.e., in reference to something that 
characterizes a “small abstract ‘scene’ or ‘situation,’” and that is activated by 
words such that the meaning of one word cannot be understood unless the 
entire scene is envisioned. This includes understanding the “social 
institutions or the structures of experience which they presuppose” (Fillmore 
1982: 116)'°. Benefactive constructions can neither be understood nor 
employed appropriately unless one is able to grasp the schematization of 
judgments and values that trigger the use of benefactive verbs, including the 
metaphorical conceptualization of “benefit” (see Shibatani 1996 for a 
discussion of the “give” schema activated by benefactives"’). In this sense, it 


See, for example, Ohtani (2005) for a discussion of important syntactic/morphological 
similarities and differences. 

In addition, a frame is “a system of categories structured in accordance with some 
motivating context,” where a motivating context is “some body of understanding, some 
pattern of practices, or some history of social institutions, against which we find 
intelligible the creation of a particular category in the history of the language 
community.” 

His discussion mainly addresses natural semantic conditions such as the following: (1.) 
the NP2 is human, (2.) possessive control is interpreted abstractly (metonymic relations), 
and (3.) a possessive situation is created by someone other than the possessor and is 
construed such that it is beneficial to the possessor. However, Shibatani reveals the 
importance of local conventions when he observes that the acceptability of benefactive 
constructions varies according to the conventionality of the situation (“sweeping the 
floor” is more prototypical than, e.g., “smashing a centipede”). 


10 


11 


Presented by: https://jafrilibrary.com 


Presented by: https://jafrilibrary.com 


126 ~— Barbara PIZZICONI 


is necessary to presuppose a certain kind of cognitive representation of 
reality that is codified in terms of benefit-generating actions (or conversely, 
as in the case of the complementary affective category of passive 
constructions, adversative-effects-generating actions), benefactors, and 
beneficiaries. This frame is institutional in nature to the extent that it is a 
socially constructed and conventionalized (though not unmodifiable) frame. 

Besides broadly defining these verbs as indices of a communicative 
style, I would like to highlight two important anti-deterministic 
qualifications. The first is that I make no claims regarding the obligatoriness 
of benefactive structures in the use of Japanese (see Pizziconi, 2000b). A 
speaker of Japanese may or may not choose to use these structures to 
characterize an action or event. The claim that the use of these constructions 
defines a communicative style is primarily derived from the existence of a 
pragmatic (and cognitive) category that is linked to a set of contrasting 
syntactic options; consequently, these options require/allow speakers to make 
such an expressive choice. Moreover, it is based on personal observations 
that these structures are, in fact, routinely deployed in the whole range of 
their meanings (Pizziconi 2000b). The degree to which a speaker exercises 
control over these constructions varies and depends on the specifics of the 
utterance (whether or not spatial deixis is necessary, whether the viewpoint is 
constrained by the discourse, etc., see below). However, there exist other 
means by which to maintain deictic specifications when the benefactive 
implication is not intended; conversely, whenever a speaker chooses to adopt 
a benefactive construction, s/he also consciously allows for the possibility of 
that implication, even if it is not foregrounded. 

Second, I envisage no direct link between the use of benefactive 
structures and the indexing of a specific cultural identity (e.g., a Japanese 
“interdependent” sense of self). In other words, I wish to avoid making any 
claims regarding Japanese identity based merely on the fact that the Japanese 
language uses benefactive constructions. These structures merely serve to 
index an affective stance that acknowledges/construes the effects of an 
individual’s actions on another as beneficial. As previously mentioned, this 
fact may occasionally be emphasized, while at other times, it may merely be 
an interpretation that is available but not salient. Whether or not such a 
stance is associated with more general identity traits is inevitably and equally 
dependent on situational factors as well as individual beliefs and ideologies 
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and hence, is contextually variable’*. Nevertheless, the very existence of 
these grammatical constructions permits or determines the speakers’ recourse 
to such conceptual characterizations and simultaneously establishes or 
maintains the specific cultural practice of explicitly signaling the positive or 
negative effects of certain types of human and natural events on individuals. 
It is not difficult to understand the way in which this practice may very easily 
lend itself to the marking of identities. 

The system of benefactives in Japanese has no structural equivalent in 
English, and consequently, it is dealt with in rather explicit terms in 
textbooks and in the classroom. Benefactives are often presented as a 
semantically homogenous and systemically organized conceptual category, 
as is seen in the following table. 


GIVE RECEIVE 

SUBJ (SPEAKER) > IND. OBJECT | SUBJ > IND. OBJECT (SPEAKER) | SUBJ < IND. OBJECT 
(V-te) yaru 

(V-te) ageru (V-te) kureru (V-te) morau 

(V-te) sashiageru* (V-te) kudasaru* (V-te) itadaku* 


* = +HONORIFIC (lexical substitutes); (SPEAKER): includes ingroups 


Benefactives, or “giving-receiving” verbs (yarimorai or juju dooshi), 
are a set of seven verbs in various relationships of contrast at the syntactic, 
semantic, and pragmatic levels. 

Benefactives can be used independently or as auxiliaries. They are 
extremely productive when used as auxiliaries and can be used in 
conjunction with almost any verb; they carry the connotation that the action 
or event described by the main verb has a certain beneficial effect on a 
determined recipient. To the extent that these constructions are fully 
grammaticalized, their lexical meaning is (or can be) “dimmed” (see Note 5), 


2 This characterization of the relationship between linguistic forms, stances, and identities 
is considerably indebted to the work of Elinor Ochs, and in particular, her papers 
published in 2002 and 1996 as well as her categorization of the four interdependent 
dimensions of action, stance, identity, and activity. She states that “...social groups 
associate particular stances with particular actions, associate these linked stances and acts 
with particular social identities and activities, and associate particular activities with 
social identities.” (2002: 110). Identities and activities are more complex and are believed 
to entail particular actions and stances. On account of space constraints, in this paper, I 
will not elaborate on the types of institutional activities that are systematically linked to 
the particular stances that a speaker can instantiate through honorifics; however, it is easy 
to find many examples of this occurrence in customer-oriented discourse and 
advertisements. Similarly, one can easily observe the way in which they are linked to 
actions, for example, requests or appeals. 
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and the meaning of the overall structure can be glossed as follows: 
something happened to an individual [and s/he was positively affected by 
lees 

As space constraints do not permit me to present a detailed description 
of the system, it is natural that the extremely concise outline that follows 
overlooks many important details. My discussion will also pay little attention 
to ageru/sashiageru, which appeared in only a few instances in the data, and 
which have distinct sociopragmatic constraints. 

Previous accounts (see Shibatani 1996, Clancy 1986 for extensive 
discussions) have discussed syntactic, semantic, and acquisitional issues. 
While I subscribe to a cognitive, schema-based account such as that explored 
by Shibatani (1996), I would like to add the observation that the use of 
benefactive constructions generates meaning on the basis of not only the 
semantics of the benefactive verbs but also the contrastive network or the 
relationships in which each benefactive participates. 


1. AgaB wo shootai suru A invited B 

2. AgaB wo shootai shite kureru A invited B (+BEN) 

3. AgaB ni shootai shite morau A was invited by B (+BEN) 
4. AgaB ni shootai sareru A was invited by B 

5. AgaB wo shootai shite kudasaru A invited B (+BEN) (+HON) 
6. Aga Bnishootai shite itadaku A invited B (+BEN) (+HOoN) 


From a syntactical point of view, 1 is opposed to 4 (while both are 
non-benefactive structures) in that the former is in the active voice while the 
latter is in the passive voice. A similar opposition exists between 2 and 3 
(note the grammatical cases, with kureru as an active-like structure and 
morau, a_passive-like structure, cf Murakami 1986, Ohtani 2005). 
Semantically and/or pragmatically, 2 and 3 contrast with 1 and 4 in that a 
benefactive component is added to the same propositional meaning (when 4 
is a “syntactic” passive); 2 and 3 are in opposition to 1 in that they interpret 


Therefore, by utilizing such grammatical structures, it is possible to say that an individual 
gave you something or did something for you, or also, more peculiarly, that an 
intransitive action has a positive effect on or yields a positive outcome for you. The 
semantics of the verbs is based on the “give” constructions (Shibatani 1996); hence, the 
frame that is activated is that of the transfer of beneficial actions. While describing events 
in this manner, the speaker must also make an additional choice of frame that concerns 
his/her social or affective deixis, i.e., which of the participants, besides the speaker 
him/herself, to empathize with in the description of benefactors and beneficiaries. This is 
because the “giving” verbs ageru/kureru and sashiageru/kudasaru are not symmetrical 
and they impose constraints on the referents who can act as givers or recipients. See a. in 
the list of functions illustrated below. 


Presented by: https://jafrilibrary.com 


Presented by: https://jafrilibrary.com 


Learning to Reframe 129 


the context in terms of benefit rather than a lack of it, and are in opposition 
to 4 with regard to the benefactive vs. malefactive effects of the proposition 
(when 4 is an “adversative” passive). An additional (and less problematic) 
distinction is that between the non-honorific constructions in 2 and 3 and the 
honorific constructions in 5 and 6. Coincidentally, this distinction serves to 
demonstrate the inadequacy of the analyses of the distinct levels of syntax, 
semantics, pragmatics, or sociolinguistics in their attempt to describe the 
multiple dimensions that are simultaneously called into question in the 
choice of either term. 

The following is a brief account of the manifold functions of 
benefactives (reproduced from Pizziconi 2000a). Benefactives contribute to 
the determination of the following: 


a. A spatial deixis, i.e., who does what for/to whom. This aspect is particularly 
relevant since the use of pronouns is not crucial to the construction of an utterance. 
For example, invitations that are “sent out” are marked with ageru, and those that 
are received are marked with kureru, as in “shootai shite ageru/kureru.”, in 
which it is not necessary for any referents to be explicitly marked; these are 
additionally constrained by specifications of the speakers’ empathy toward other 
participants. Speakers and their ingroups can only be referenced by the subjects in 
ageru constructions and the indirect objects in kureru constructions; therefore, by 
choosing either one of these auxiliaries, it is possible to indirectly index “whose 
side an individual is on.” This is linked to the choice of: 

b. voice/viewpoint in the active-like “give” and _ passive-like “receive” 
constructions’. For example, “A ga B wo shootai shite kureta” (A invited B) is 
an active-like construction wherein A is an agent. This is in contrast to “A ga B ni 
shootai shite moratta” (A was invited by B), wherein A is the patient. 

c. Viewpoint cohesion, the absence of which leads to the formation of nejirebun (lit. 
twisted sentences), i.e., ill-formed sentences with more than one viewpoint 
(Tanaka 1999). For example, “tomodachi ga haruyasumi ni ryookoo ni ikanai ka 
to sasotta keredomo, okane mo hima mo nai node kotowatta” (my friend invited 
[me] to travel with him during the spring holidays, but I declined since I cannot 
afford it). On account of the absence of the verb kureru, the event cannot be 
anchored to the speaker and consequently, the viewpoint of the two clauses is 
“disjointed.” 

d. social deixis, through the selection of honorific or non-honorific options. For 
example, “A ga B wo shootai shite kureta vs. kudasatta” (A invited [+honorific] 


i, anne ene : eT, F : 
Pizziconi (2000a) finds that learners appear to prefer active-like constructions. This may 
be related to the preference for objective constructions, like those mentioned above, in 
the English language. 
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B), “A ga B ni shootai shite moratta vs. itadaita” (A was invited [+honorific] by 
B). 

e. affective stance, which evokes an acknowledgement of interconnectedness or 
indebtedness. For example, “Taroo wa su-pa- e itta” vs. “itte yatta” (“Taroo went 
to the supermarket” vs. “Taroo went to the supermarket for her”). 


Benefactive constructions are therefore motivated at different levels, 
have different constraints on their use, and can be “read” more or less 
literally (i.e., their “prototypical” benefactive meanings can be more or less 
salient)’®. The use of a benefactive implies the potential availability of 
grammatical, social, and affective meanings on account of the paradigmatic 
relations in which they participate; the relevant meaning must be determined 
by the participants on the basis of the situational context. 

Incidentally, it should also be noted that benefactives participate in a 
number of conventional (deferential) structures such as [causative + 
morau/itadaku] (e.g., “setsumei sasete itadakimasu,” i.e., “I’ll explain this 
for you”), formulaic expressions (e.g., “ippantekina goshitsumon ni 
tsukimashite wa kochira wo sanshoo shite toiawasete_kudasaimasu_yoo 
onegai_itashimasu,” i.e., “please refer to this for general enquiries,” and 
idioms such as “nosete itadaku,” which means “to get a lift.” (The last 
example, in fact, appears in my database.) Although there are different types 
of constraints on the use of all these expressions, it is natural that they make 
significant contributions to the process of learning the constructions. This is 
because they not only make the constructions more visible and salient, given 
their common occurrence, but also more memorable. They can be used as 
readymade chunks before the learners are able to analyze their structures. 

Given the variety of the possible valences of a benefactive form, 
learners are faced with the task of determining the extent to which these 
meanings should be interpreted literally (Byalistock 1993). Socialization is 
functional to the development of a “commonsense” interpretation on the 
basis of the knowledge of conventional conceptualizations that is derived 
from extended use. To the extent that it is necessary for the individual to 
learn which actions can be taken for granted and which can be consigned a 
beneficial characterization (see Pizziconi 2000b), it is clear that it is a 
socially constructed perception of reality that is being referred to; therefore, 
it is possible to assume that the “benefit metaphor” is acquired through 
participation in activities in which it has a social significance. If this is a 


15 quaue ene ac F : 
However, when these auxiliaries are used with intransitive verbs—which are peripheral 
cases of the use of the prototypical “give” construction, as in Shibatani’s (1996) 
analysis—the preferred reading would have to be that of the “affective” meaning. 
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habit as described by Bordieau—a disposition that one develops through 
extensive exposure to learned worldviews—it would be expected to emerge 
from discourse rather than from texts (see Widdowson 1995: 70, Roberts et 
al. 2001: 80), and from dynamic exchanges (even in the social context of a 
classroom) rather than from structurally-oriented books on grammar. 

In order to reveal the significance of this cognitive construal, it may 
appear natural or even unavoidable to take recourse to a cultural notion of 
“interconnectedness” (the concept that action and language are “regulated” 
by the same conceptual system is the tenet of cognitive approaches to 
metaphor, e.g., Lakoff and Johnson 1980), especially if this construal is 
perceived to be “in sync” with the cultural metalanguage of social 
relationships in Japan’®. However, I will argue against a pedagogic treatment 
of these structures, which disentangles their lexical meaning from the 
network of other meanings described above and encourages an essentialist 
reading of the metaphor. The adoption of such an approach serves only to 
reinforce cultural stereotypes which although unlikely to entirely prevent 
acquisition are, nevertheless, likely to retard or unnecessarily hinder it, and 
does not contribute in anyway to the acquisition of the structures as a 
linguistic resource. Learning the metaphorical structuring of a language both 
cognitively and affectively is crucial because “‘metaphorical imagination is a 
crucial skill in creating rapport’ [sic: quoted but not referenced]. 
Understanding how a speaker is using a metaphor—the webs of meaning 
entangled in it—is a way of sharing and perhaps empathizing with that 
speaker” (Roberts et al. 2001: 73). However, we will examine the way in 
which undue stress on the cultural correlates to the expenses of the mere 
linguistic convention may result in a clash with the cultural values of the 
learners and provoke unnecessary misconceptions. 

In this paper, I have listed the functions of benefactives, ranging from 
those that are the most syntactically motivated to those that are the most 
expressive and affectively motivated. This type of cataloguing inverts the 


- I am referring to the extensive literature on the collectivistic, hierarchy-conscious, 
duty-bound Japanese orientations and their reflections in the individual sphere: situated 
selves, inter-subjective psychology, and dependency bonds. Works with this orientation 
are too numerous to be listed here; however, there is very little literature that discusses 
the ideological underpinning of these constructs. One of the few notable exceptions is 
Befu Harumi’s recent (2001) Hegemony of Homogeneity, Trans Pacific Press. 
Nevertheless, such deconstructions must question the “innateness” or the motivation for, 
rather than the existence of practices (for example, greetings on subsequent encounters 
that acknowledge previous exchanges: “kono aida doomo...”) through which individuals 
are socialized to be sensitive to their social surroundings and to the manner in which a 
wide range of actions is likely to affect them. The generation of chains of indebtedness 
and sustained interrelatedness is an example of the way in which this occurs. 
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phylogeny of these constructions (whereby the auxiliary use is derived from 
the metaphorical adoption of the lexical value of the benefactives in 
increasingly generalized structures, to the extent that the benefactives 
become entirely grammaticalized and their original lexical meaning is 
diluted as a result of routinized usage, Hiraga 1999, Hopper 1998). In 
contrast to the received conceptualization of these constructions by native 
speakers (NS), who appear to be largely unconcerned with the literal 
meaning of the auxiliaries in complex constructions, learners (arguably on 
account of teaching conventions as well) appear to be fairly sensitive to the 
lexical meaning of the auxiliaries. As is evident in their reports, this is the 
factor responsible for triggering and, crucially, constraining the learners’ 
interpretation of constructions in the early stages of learning. 


4. The learners’ metalinguistic comments 

Pizziconi’s (2000a"’) analysis concluded that no direct progression from 
the compulsory to optional uses of benefactives could be observed, and that 
the semantics of the verbs were of no assistance in the prediction of the 
accuracy of the use’*. However, in general (albeit only at the group level), 
both the frequency and accuracy of use increased with the proficiency of the 
learners, thereby demonstrating their increasing ability to adjust to the 
preferred conceptualization (and mapping) of the target language/culture. 
This is evident in Chart 1. Note that this is no more than an indication of the 


'” The method included an oral interview and a written questionnaire (both in Japanese) as 
well as a post-test discussion. In this discussion, the beginners/intermediate learners 
invariably chose to speak in English, while the advanced learners frequently switched 
between English and Japanese. In these post-task sessions, I accumulated further 
demographic data, obtained clarifications of the learners’ intended meanings in the 
interview, disclosed the object of my research to them, and documented their 
metalinguistic judgements regarding the constructions and their own performance. There 
was an extensive variation in the scope of these judgements, depending on the students’ 
proficiency level, the time available to them, and other personal factors. 

Pizziconi (2000a) assumed that compulsory uses, i.e., uses demanded by the need for 
deictic specification, would be more “transparent” to learners than those that were more 
“volatile,” i.e., the optional and subjective expressions of affective stances. At present, 
this assumption, which was based on an NS’s parameter of grammatical normativity 
rather than on a universal criterion of semantic clarity, appears to be slightly arbitrary. 
The perplexing results, which indicated that even advanced users could “miss” the 
obligatory uses and that some beginners could produce non-obligatory constructions, may 
be advantageously revisited from the viewpoint of the ease of the association of a 
determined situation with a frame of “benefit transfer.” Although cases such as the use of 
benefactives in the case of oshieru (to teach) or their avoidance in the case of sagasu (to 
search) could perhaps be explained from this viewpoint, their use in conjunction with 
noseru (literally, “to put on,” but appearing in a formulaic use in “to give a lift”) and their 
avoidance in conjunction with shookai suru (to introduce) would, nevertheless, require 
another explanation. 


18 
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learners’ increased “confidence”; a qualitative description of expressive uses 
must be examined individually. I will refer to the specific uses while 
illustrating the data. 


novice 


intermediate 


advanced 


0% 20% 40% 60% 80% 100% 


Ocorrect Elmisused Elavoided 


Chart 1. BENEFACTIVES USED/MISUSED"/AVOIDED PER PROFICIENCY LEVEL 
(NNS) 


The chart provided below compares the frequency of the use of 
benefactives among both the native speakers and learners during the 
30-minute slots in which the interviews were conducted. The apparently 
curious result that the use of benefactives among advanced learners exceeds 
that among native speakers will also be discussed in the next section. 


19 “Misuse” indicates the use of kureru in place of morau or vice versa, or the use of 
inaccurate case particles, e.g., [Int: did you call by yourself?] and NNS21: “...*okaasan 
kara tasukete kuremashita” [*from my mother (I) helped]. 
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NS (10) 


advanced (7) 


intermediates (11) 


novices (4) 


Chart 2, AVERAGE NUMBER OF OCCURRENCES X GROUP (NS = 10, NNS = 22) 


I will now present and discuss the learners’ own metalinguistic 


comments. Although this study focuses on the learners’ metalinguistic 
assessments of their performance in the use of these constructions and not on 
their actual use, their performance at the group level and their individual 
performance when individual cases are presented will also be illustrated 
briefly in order to place their comments in perspective”. Occasionally, I will 
also quote remarks made by my own students, who participated in a survey 
on benefactives on a speparate occasion”. This is done in order to 


20 


21 


The constructions analyzed in the two previous studies, which are the targets of the 
learners’ comments, are those in which benefactives appear as auxiliaries and those in 
discourse that reports events rather than direct appeals, i.e., discourse in which many 
requestive expressions are conventionally used in conjunction with benefactives. In this 
study, the students are identified by numbers, e.g., NNSn. 

These are retrospective comments that were obtained in a written survey conducted in 
2004, in which second and third year students participated voluntarily. Here, these 
comments are used to illustrate certain general points from the perspective of the learners 
rather than that of the researchers. The survey comprised the following five questions: 
(1.) did you find benefactives difficult to understand when you first learned them in year 
one?; (2.) did you start using them straight away or did you tend to avoid them in the 
beginning and use other expressions instead (if so, which expressions)?; (3.) do you use 
them naturally in conversation now?; (4.) what do these verbs tell you about the 
“Japanese” way of speaking?; (5.) anything you want to say about learning and using 
these forms. Students in this study are indicarted by a coded initial, e.g., XYn. 
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corroborate some of my observations with the learners’ own statements. 


Beginners and Intermediate learners 

Beginners do, occasionally, use benefactive constructions (three 
instances of use per subject as opposed to an average of 7.5 instances of use 
per subject across all proficiency groups); however they use them less 
frequently and less fluently than other groups (see Pizziconi 2000a: 9). These 
learners are, in general, aware of the structural properties of benefactives. 
However, they do, at best, possess passive competence, or active competence 
below the level of automization (see McLaughlin 1987) that is necessary to 
employ benefactives fluently in the course of natural conversation. More 
important, the metalinguistic awareness of these learners is also largely 
confined to an awareness of the formal properties of the constructions, while 
pragmalinguistic functions are, in general, neglected. 

This group utilizes benefactive constructions merely in contexts that are 
semantically clear (yuujin ga oshiete kureta = my friend told me/explained to 
me) or in formulaic, idiomatic uses (Kyoto made nosete moratta = I was 
given a lift to Kyoto). 

In general, the manner in which this group constructs an utterance 
continues, to a great extent, to reflect the cognitive conceptualization—and 
the linguistic instantiation—of their native tongue’s system. Consequently, 
the use of benefactives is never motivated by an attempt to produce a 
framing of events foregrounding affect, and it can be “forgotten” even in 
contexts wherein the syntax of Japanese obligatorily requires deictic 
specifications. The existing cognitive frames show no indication of being 
modified or restructured irrespective of the circumstance—learners are likely 
to adapt new information to what they already know—in the same way that 
the conceptualizations of the target language are shaped by those of the 
native language, “target language words and structures are ‘carriers’ of [...] 
native concepts” (Danesi 1993, quoted in Lantolf 1999: 43). Beginners, who 
are engaged in the extremely demanding task of extracting meaning from the 
overwhelmingly rapid stream of speech, can barely attend to the 
technicalities of such complex functional structures. One of the beginners 
participating in this research (NNS6) made a rather revealing statement when 
he commented on his use of benefactives as follows: “I don’t pay attention to 
them unless I am in an exam.” 

The particular framing conveyed through the use of benefactives—which 
necessarily involves the adoption of a specific affective stance—is likely to 
be redundant information unless some other intervening factors permit a shift 
in attention to target language-specific meanings and forms. Such factors 
must include both a procedural dimension, i.e., the freeing of additional 
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attentional resources from low-level manipulations by means of repeated 
practice, and a social dimension, namely, the “shared experience” that serves 
as the foundation for the establishment of networks of new conceptual 
categories. (Roberts et al. 2001: 59). 

The following are extracts from the post-interview discussions, in 
which a Japanese NS research assistant and I would typically ask learners to 
repeat—in English—descriptions of events that they had previously 
recounted in Japanese (which prompted or could have prompted the use of a 
benefactive construction). This procedure was an attempt to outline a 
rationale for the use, or lack thereof, of these constructions and to obtain 
information on the way in which the students learned them, etc. 


NNS15 is a female American learner aged 20 years, who studied Japanese for six years in 
Hawaii and spent three months in Japan. Her proficiency level has been assessed as Novice+. 
She displays an almost total avoidance of benefactives in both the written questionnaire and 
the oral interview”. 


Int: Do you use them consciously? Or do you avoid them consciously? 

NNS15: Sometimes, I try and use them but they are really hard and it’s very confusing 
between ageru and kureru.... I avoid using morau because I always think I can 
substitute kureru with morau, or if not I just switch the ageru people around. 


Int: Normally, morau is not considered to be complex because it is more like 
‘receive,”**, whereas with kureru the subject has to be someone else, so you have 
to be talking about that person....But you seem to think morau is more complex.... 

NNS15: yes, because...you see, with kureru you can just use that one person, and be 
stating that “you or someone really close to you receives something from that 
person” , whereas I think pronouns are what make morau and kureru all that more 
complicated, because you have “somebody ga somebody ni...”, and it is really 
hard to relate the two pronouns plus which verb to use. 


Int: Do these verbs allow you to express what you want, or you simply don’t use them 
frequently because you don’t think about them... 

NNS15: I don’t think I use them that frequently but...it really helps to express more...Its a 
good thing that we have, because it allows you to say something without creating a 


separate sentence, but...it is really difficult to use so sometimes I do try to avoid it. 


*2 She uses kureru in only three instances; in fact, in two of these, kureru is mistakenly used 
in place of morau. 
Morau (to receive) places no pragmalinguistic constraints on the choice of the subject. In 
contrast, ageru (to give “outward”) can only be used when the speaker (or someone in his 
ingroup) is also the giver, and kureru (to give “inward”) can only be used in the specular 
cases in which the speaker or someone in his ingroup is the receiver. 
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Int: Can you think of any other device that you use in their place, to show you are 
appreciating their doing something for you... 

NNS15: I would only use something like...“she did something and I was ureshikatta 
{=happy]”. [...] But then we spent...maybe a day in several months on them... 


Despite their praise for the economy of this device, these “novice” 
learners are generally unable to recognize the occurrences of benefactives 
and are also aware that they lack control over the use of these verbs; 
therefore, they actively resort to avoidance strategies or rely on “safe” 
formulaic uses. This is a cognitive limitation that probably cannot be 
changed by instruction to the same extent that it can be changed by repeated 
practice. One of my own students reports the following strategy: 


DK3: It is was a fairly safe bet to stick with set phrases such as “-te itadakemasen ka” or 


“-te kudasai” for requests and just to avoid using them at all when talking about third 
parties etc. 


Moreover, perhaps as a consequence of this limitation, the evaluations 
of the learners at this stage were not very detailed but merely comprised a 
general expression of confusion with regard to this rather complex set of 
grammatical structures. 


JM2: Yes, in fact it is quite confusing that there are so_ many words for “give” and 


“receive,” and the use of the “ni” (which can mean “to” or “from” in English) is quite 


tricky. 


The fact that it is possible to construct multiple, similar benefactive 
phrasings does, indeed, result in much confusion—pragmalinguistic nuances 
are difficult to tease out even when the syntactic workings have been 
clarified. 


AB3: I wouldnt say difficult to understand but sometimes I used to confuse the meaning, 
for example: ‘tomodachi ni kami wo kitte moratta’ and ‘tomodachi ga kami wo kitte kureta,’ 


I wasn‘ sure if that meant that ‘I_asked my friend to cut my hair for me’ or ‘my friend just 


did it of his own intention’. 


JHMA: Didn't start using them straight away. Generally, I just used the main verb with no 
indication of who was doing what for what [sic]. I often would realise later that I could have 


used them and then would work out what i could have said. However, this didn’t mean that I 


used them easily on future occasions—I’m not sure why. 
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Along with this obvious procedural tentativeness, the most significant 
cognitive distinction between the beginner group and the high-proficiency 
group is that the beginners prefer to restrict themselves to “familiar” 
meanings, in the expression of which there is neither the space nor the need 
for yarimorai constructions. 

The use of benefactives among intermediate-level learners is slightly 
more frequent (approximately 4.5 uses per subject); however, their 
performance continues to lack confidence (along with 40 correct uses, 18 
instances of avoidance and 13 of misuse were documented in all). 
Nevertheless, it is possible to observe cases such as the following, in which 
benefactives appear to have fallen entirely into a “blind spot.” 


NNS14: This lower-intermediate level learner represents an extreme case at the lower end 
of the spectrum since he entirely avoided benefactive constructions in both the written 
questionnaire and the oral interview (for example, his response to the interviewer’s question 
sensei wa naoshite kureru_n desu ka [does the teacher correct+BEN you?], was iie, 
naoshimasen [no, he does not]). The learner is an American male aged 20 years, who studied 
Japanese for two years in America and spent four months in Japan. 


Int: Do you think you don’t use them because you don’t understand the meaning or is 
it a meaning you don’t want to express? 

NNS14: Neither, I think. I believe I understand the meaning, and I have been taught the 
meaning and the right time to use it, but it’s not something that I am familiar with 


using, its a concept that...you know...“to do something for another person”... am 


more used to just saying: “he read the book for me”, rather than saying “He read 


the book for me doing me this favour”. 


Int: It’s interesting that you said that, because in textbooks this meaning is 
exaggerated, but in fact they are just grammatical devices. 

NNS14: Perhaps, but I was also taught “watashini” would be another way of saying “for 
me” or “on my behalf”, so because of that I think I hardly use it, and also in 
conversation I found that friends weren’t using them, so perhaps, I was maybe 
misled, but I was given the impression that it wasn’t exactly the same.... 

Int: It’s interesting that in English you said: “they really try hard to explain to me”. So 
obviously you are acknowledging that it was not something that you can take for 
granted, but they were making an effort for you, which is exactly the meaning that 
a verb like kureru would express...but still you never used it in Japanese. So, it’s 
not that you are not aware of... 

NNS14: No, I am very much aware of it, I studied it about a year ago and we studied it in 
intensive language [??], and occasionally IJ also use it in class though I hardly ever 


use them conversation. Iam aware of not using them, but I think...though I 


understand it I maybe don’t know when to use them, or maybe I am not exactly 


Presented by: https://jafrilibrary.com 


Presented by: https://jafrilibrary.com 


Learning to Reframe 139 


comfortable with them. 


Int: [...] Are you aware of your friends using them? 

NNS14: If they used it I think I’d notice it, though I usually don’t think I would think 
anything of it, other than...maybe what you said about the direction, but at the 
same time I don’t think I would respond by using the opposite. 


Int: When you say, “my friend called me”, how do you say that in Japanese? 

NNS14: “tomodachi wa watashi ni denwa shita” or “moratta”. 

Int: But you know that you could use “denwa shite kureta”. 

NNS14: I would use “watashini” because there it translates English more directly, because 


if you say “denwa wo shite kureta,” I as a subject never comes into this, and so if 
you say “watashini” at least the sentence is more structured. 


Int: That’s interesting; so it helps you make the structure clear. 
NNS14: It’s because I am still thinking in English and I haven’t yet...it took me five years 


before I could start thinking in French so, I know that after 2 years of study it’s 
not a point yet where I can think in Japanese. 


In addition to his diffidence toward the complexities of this set of verbs 
and the tailoring of Japanese utterances to fit English conceptual frames, it is 
interesting to note that in his answer to the first question, NNS14 mentions 
the lexical meaning of the verbs. It can be argued that it is the notion of a 
“favor” that is conspicuous upon the introduction of these verbs (often 
because of the way in which they are introduced), and that this notion will be 
considered to be the primary aspect of meaning in all the uses of these verbs. 
However, even in cases in which this benefactive connotation can be 
acknowledged, these verbs continue to retain an element of unfamiliarity; 
this can make learners uncomfortable with actively employing them. I will 
return to the critical issue of the largely literal readings of these verbs later. 

While intermediate learners who are at a slightly more advanced level 
continue to be aware that their performance is conditioned by the demands of 
spontaneous conversation on processing time, they appear to approach a 
threshold at which they begin to “get the feel” of the constructions. 


NNS21’s use of benefactives is largely correct—eight out of twelve uses in the questionnaire 
and five uses in the interview were correct, and there was only one instance of misuse. 
NNS21 is an intermediate-level learner—an American male aged 19, who studied Japanese 
for four years and spent three months in Japan. 


NNS21: I know the difference but I don’t use them, I don’t know how to use them. 


Int: Do you know if and how your friends use them? 
NNS21: [hear a lot of kureru and morau but I don’t hear much of the other ones... 
Int: That is very likely if you only talk with friends. But you used it in Sentence 1 [of 
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the questionnaire: tomodachi ga pa-ti ni kite kurete, ureshikatta], which is 
surprising because you don’t seem to be using them like this [optional uses, BP] 
in conversation [...] 

NNS21: I did think about it. I thought about whether I should use kureru or not and then I 
tried to think of how a Japanese would say it, and so I thought it would be kite 


kuremashita. [...] It_is a matter of having time to think about it [...] One problem 


for me is that I just use intuition, so knowing the rules doesn’t really help, I need 
to get a feeling that it is “right.” 


Int: Would you find it strange if someone said, “Okaasan ga yorokonde kureta”’? 
NNS21: No, I would find it all right. 
Int: But would you use it yourself? 


NNS21: I would probably say, “Okaasan ga totemo ureshikatta.” It doesn’t come naturally 
to use it, but if I hear it I think, “Oh, yeah.” 


The accumulation of linguistic experience is crucial in that it facilitates 
the perception of what “sounds native” (which can be considered evidence of 
a computation of what is frequent and conventional in the input). Further, it 
allows for the adoption of novel and more complex constructions, including 
benefactives, that are divergent from the initial repertoire, thereby admitting 
a new type of “metaphor” (the benefit-transfer metaphor) into this repertoire. 


MDMA: [...] However, after a few weeks, I could understand ageru, morau, kureru, 


itadaku, etc. It wasn't until I went to Japan that I really got the hang of how to use them with 
other verbs [...]. I think it took me about six months or so of being in Japan to be able to use 
these constructions without thinking too hard about them. Before I really understood how to 
form a sentence like “Tomodachi ga kite kurete, ureshikatta,” I would have probably said 
something like “Tomodachi ga kite, ureshikatta.” 


My claim is that the working of the benefactive structure is similar to 
that of the Trojan horse. The metaphor is gradually adopted on the grounds 
that it is used for syntactic and discoursal reasons and hence occurs 
frequently in the input. However, it carries the potential for constructing new 
“frames,” and hence, for evoking new stances and ultimately, constituting 
new identities. The structure opens up new avenues in the sociocognitive 
landscape once it has been integrated into a linguistic repertoire. However, 
and perhaps paradoxically, this is likely to occur at a level at which the 
structures have become formulaic, i.e., at a level at which they are fluently 
manipulated as chunks of “native” behavior with a lower lexical and a 
stronger grammatical valence. 

This “infiltration” of a new metaphor into an individual’s repertoire is 
probably not a “naturally” conscious process; nevertheless, a particular 
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linguistic sensitivity, a solicited introspection, or a professional analytical 
interpretation could presumably bring it to consciousness. However, the 
sociocognitive construction of the frames is impervious to introspection, and 
thus lends itself to the creation of personal narratives that can then be used 
for argumentative and social purposes. 


Advanced learners 

The connotations of my use of the term “Trojan horse” may be further 
illustrated by the words of NNS20, an advanced learner. It is interesting that 
the rather dismissive remarks that follow are those of a person who is not 
only the most prolific user of benefactives in the entire group but also a 
person who uses them rather liberally. Some of the situations she describes 
are clearly benefactive, such as a teacher providing her with a reference letter 
(suisenjo wo kaite kudasaimashita) or her Japanese boyfriend not correcting 
her mistakes despite her expectation that he should do so (naoshite kurenai). 
Some of the other situations described by this learner were less obviously 
benefactive, for example, neighbors not initiating conversations 
(hanashikakete kurenai), other people being able to understand her when she 
spoke in Japanese (wakatte kurereba...), and her becoming interested in 
Japanese because of a teacher who showed the students videos in class 
(bideo misete kuremashita). 


NNS20 is a female learner from New Zealand; she is 23 years old and has studied Japanese 
for nine years. She has lived in Japan for two years. She uses benefactives more frequently 
than any other learner (24 occurrences in the interview), and a majority of these are very 
accurate. 


Int: As far as keigo [honorifics] is concerned, you seem to be shifting from kureru to 
kudasaru at random... 

NNS20: Is not something ...1 mixed them for a long time...all through high school 
basically...they are _a mystery to me...once I got into university and my 


teacher...I had the context so I had to ask for things [...] or asking for various 
things in class...But it’s still the keigo [honorifics], and_as an English speaker, 


sometimes I don’t see the necessity. 


Int: But there is no really polite implication in most of the uses of kureru... 
NNS20: When I am using them I don’t use them so much thinking that these people are 


doing something for me. I think it just makes it [the action] clearer. 


If the learner’s intention in the abovementioned instances was merely to 
“make the action clearer,” she could have used, for example, —te kuru 
constructions, passives, or even the main verb without an auxiliary; however, 
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she did not do so. Her statement regarding not “seeing the necessity” of 
using benefactives (which she considers to be keigo) cannot be considered a 
reflection of her actual frequency of usage (24 occurrences, as opposed to an 
average of 9.5 occurrences among native speakers). 

It may be held that this learner has learned to use benefactives as a kind 
of stencil, i.e., a convenient device that can be used in a variety of novel 
situations. She claims that she uses them only “to make the situation 
clearer”; therefore, she is aware that benefactives do not always carry a 
strong lexical meaning. This indicates that they are used as grammatical 
devices in a manner evocative of the way native speakers construe their use 
of these structures. However, the high number of benefactives used raises the 
question of whether this learner is still capable of making deliberate 
pragmatic choices regarding the use of benefactive and non-benefactive 
framings, or whether this has turned into a case of conditioned 
hypercorrection. I will return to this question after examining two more 
cases. 

The most perplexing characteristic of the metalinguistic judgments of 
the most advanced learners is that they indicate a discomfort with the use of 
benefactive verbs, which is, in fact, not at all reflected in their actual 
performance. These leamers are relatively self-assured and fluent in their use 
of benefactive constructions and, quite surprisingly, they not only match but 
also exceed the frequency with which native speakers use them. This can be 
observed in Table 2. 

Many learners answered the general question on benefactives by 
referring to a moral dimension, thereby suggesting that this dimension may 
indeed be salient in their conceptual schema. 


EG3: To me, these type [sic] of expressions demonstrate an emphasis in Japanese to show 


gratitude to people who do something for you. I suppose it may be related to the concept of 
“giri”—the idea that you owe this person something for their favor. 


JM2: The fact that “to give” or “to receive” is used so commonly for actions (in English, 
for nouns defining tangible things) is maybe another indication of the “girishakai,” i.e., the 


grammar supports the idea of obligation. 


Keigo is often mentioned in relation to benefactives; however, we have 
seen that the honorific meaning is conveyed by only one of the many 
paradigmatic oppositions described above. The learners’ comments regarding 
keigo and its ideological underpinnings were considerably more homogenous 
than their varied geographical and cultural backgrounds would have 
suggested. However, in addition to a few negative comments on the nature of 
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such a moral order (which occur again in the interviews presented below), 
their comments also appear to hint at the mere fatigue resulting from 
constantly paying attention to such social dimensions. 


SG3: I think these verbs are another way of reinforcing the hierarchical system which 


underpins Japanese society. 


DB3: Yet another way of indicating the social status of oneself and other people. It’s a little 


bit positive and negative I suppose. I found it particularly annoying to have to show what I 


thought of my relationship with the other person all the time. I didn’t mind being polite, but 


would have like to been able [sic] to speak neutrally without having to indicate social status. 


Although the mechanics of their linguistic features are becoming 
decreasingly onerous, benefactive constructions can still be construed as 
having a purpose that is, in certain respects, at odds with the learners’ social 
identity. The learners’ comments—even when they are not corroborated by 
data on the learners’ actual performance—indicate that the social system and 
cultural values that underlie or are reflected in these structures are potentially 
misaligned with the learners’ sense of self. Benefactives, therefore, epitomize 
what can be termed a prototypical site of struggle. 

While NNS20’s succinct statement suggests that keigo is of little use to 
English speakers, NNS24, the subsequent learner, provides a much more 
vivid description of such a linguistic ideology. 

This learner is a fluent speaker and produces complex utterances such 
as “heya ga aite iru koto wo oshiete moraenakatta” (I couldn’t get them to 
tell me that there was a spare room), “tomete morainagara heya wo 
sagashite ita” (I looked for a room while being put up there), or 
“omoshiroku, okashiku hanasashite moratta ndakedo” (They kindly let me 
have interesting and funny conversations with them); yet, during 
self-assessment, he apparently stated that his use of these structures is no 
more than an opportunistic stance that is linked to his need for connections. 


NNS24 is an American male aged 27 years, who studied Japanese for four years, spent two 
and a half years in Japan, and whose proficiency level has been assessed as Superior. He is a 
frequent, though not always orthodox, user of benefactives (ten occurrences in the oral 
interview). 


NNS24: I don’t really know how I use these verbs. I learned them in context when I was 
doing my internship and dealing with a lot of different people, and from a book, 
Gone fishin’, that has had a great influence on me. 
[....] I use morau more often. But I don’t know if this is because when you learn 
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one form, you always use that and avoid the others. 

[....] I think that I use morau when it is me having them do something, whereas I 
use kureru when people do things out of their own will. 

[....] I think I have learned them as set phrases. 


[....] 1 don’t only use them naturally or spontaneously, but I need to think about it, 


for example, “[kippu no] sagaku wa Takamatsu kara harawashite moraemasu ka” 
[Can I (lit. Can I be allowed to) pay the difference from Takamatsu station?]. I 
think I thought about it while I was walking up to the conductor, but maybe this is 


a mainly interactional trick [a set phrase?, BP] and has none of the problems of 


kureru and morau on the syntactic level. 


[....] Ldo think of these verbs as something that is not really me. I am much more 


polite in Japanese than I would be in English. If I am dealing with a situation 


where I am interviewing somebody or somebody I just met, where I want to be 


nice to them or I’m kind of depending on them, I am this little student from 


nowhere and he is such a busy person; I want to interview this person and so I am 
trying to be polite, and it is like “if I could be that polite in English” I would be, 
but there is no... you know, there is only so much you can do. 

[...] I think I use itadaku a lot, but not kudasaru—it just seems too much. Itadaku 
is not so terribly polite but Kudasaru is, and I don’t hear it much; I associate it 
with an old woman in a kimono...Jtadaku seems less of a jump from morau than 


kudasaru from kureru. Sashiageru sounds very arrogant. 


Again, we observe the way in which benefactives tend to be seamlessly 
intertwined with keigo, which tends to be perceived as “foreign.” Keigo and 
benefactives are generally regarded as being “difficult,” complicated, and 
unnecessarily pervasive, and are usually perceived as being in conflict with 
one’s real persona and communicative needs (an investigation of the extent 
to which this is the result of teaching methods and the predominance of the 
stereotypical representation of the uses of keigo remains a major topic for 
further research). 


NNS19 is a female American learner aged 31 years; her proficiency level is Superior and she 
has studied Japanese for nine years and lived in Japan for two years. Her oversights are 
merely occasional (one instance of misuse and one of avoidance in the oral interview); 
however, the most striking feature of her speech is her frequent use of a variety of 
benefactives—17 occurrences in the interview, many of which are in non-conventional or 
non-compulsory environments; i.e., they are affective uses such as “[ryuugakusei senta-de] 
yasashiku yatte kureru ndesuga” (kindly did [things] for me [at the foreign student office]) 
or “josei no hou ga kanrinin wo shite kurete imasu” (she acts as caretaker for us). 
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[Commenting on some NS‘ use of kureru to refer to their own students doing the homework]: 
NNS19: [...] the students must do their homework, so...I see it objectively, and I don’t 
think that they should do it for me, but if a Japanese girl said “gakusei ga benkyou 


shite kureru,” I wouldn’t think okashii [odd], but _jibunwa sou iu _waku_ de 


kangaete inai to omoimasu. [I don’t think along those lines.] 


Int: If you were teaching a student who doesn’t want to do his homework, and then, at 
last, does it, would you use it? 

NNS19: I don’t think I would say “yatto shite kureta.” 

Int: Gimu no hou wo yuusen suru ndesu ne. [You would prioritize the “duty.”] 

NNS19: I would say a student “tegami wo kaite kureta” [wrote me a letter] or “miyage wo 


katte kureta” [bought me a present] but not “benkyou shite kureta” [studied for 


me]. 


Int: The previous person said that he noticed them when the teacher in class said 


“yatte moraimashou.” Nande yatte morau koto ni naru no ka tte, nattoku 
dekinakatta kara kigatsuita tte itte itan desu kedomo...[He said, “Why does it 
come to having someone buy something for you? I noticed that because I wasn’t 
convinced.” ] 

NNS19: I never had such an experience of sudden “kigatsuku” [realize, become aware] 


[...] dakara tabun mawari no hito ga tsukatte ita kara....kiga tsuita ka mo 
shiremasen...[ well, maybe I did notice it...because people around me used it.] 


It is interesting that during the interview, NNS19, while distancing 
herself from the way a NS had characterized her own stance toward her 
pupils’ work assignments, produced frames that even a native would not 
invariably adopt. In addition to the two rather idiosyncratic uses mentioned 
previously, this learner also said [jimu no] sasaki san ga ima [apa-to ni] 
sunde iru hito ni renraku wo totte kudasaimasu (Mr. Sasaki from the office 
contacted+BEN the people currently living in the flat) while referring to the 
person in the foreign students’ division whose job is to organize 
accommodation. When asked to comment on the way in which she had 
secured a job in a school, she said, in reference to the career office, “soo iu 
shidoo shite kureru ofisu datte, sokoni ittara, a, kou iu shigoto ga arimasu yo 
[...] de sou iu fuu ni mitsukemashita” (I found it (through) an office which 
gives+BEN this kind of advice, for example, “Try going there, they have this 
job...”). Similarly, while commenting on the teacher’s corrections of her 
Japanese, she does not hesitate to use a benefactive rather than, for instance, 
a more neutral passive—“kokono sensei wa dono toki mo machigaetara sugu 
naoshite kudasaru shi...” [This teacher would correct me immediately as 
soon as I made a mistake]. Again, it is necessary to reflect on whether these 
learners are merely engaging in hypercorrection. 

Despite the association of benefactives with keigo and the relatively 
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negative evaluation of the latter in the learners’ comments, it is certainly 
curious that the average use of benefactives per subject among the advanced 
learners is considerably higher than that among native speakers (13.7 among 
the non-native speakers vs. 9.5 among the native speakers). Many 
factors—the small size of the sample in particular—may have contributed to 
this result. Although the interviews were of the same duration and the topics 
were generally similar, natural conversation is, by definition, an idiosyncratic 
joint venture rather than an enterprise that can be precisely or unilaterally 
planned. It is, however, possible to speculate on other explanations for this 
result. If the “framing” of events in terms of the giving/receiving of benefit is, 
in fact, a linguaculturally-specific feature as is claimed in this paper, then it 
is possible to hypothesize that the framing may be perceived as a “marked,” 
and hence, salient feature. In contrast to second language acquisition 
research that is conducted within the Universal Grammar paradigm—in 
which markedness has been invoked as an explanatory factor to account for 
the degrees of learning difficulty (McLaughlin 1987: 84)—research on 
second culture acquisition has suggested that markedness is a feature that 
determines learnability (Lantolf 1999: 38%). It has been observed that 
“almost all of the most successful learners [at the Foreign Service Institute, 
BP] demonstrate substantial flexibility and a willingness to shift their 
cognitive set, many of them even to the point that they, in fact, appear to 
develop a kind of target-language subpersonality when speaking the target 
language” (Ehrman 1993). It is possible that this “target-language 
subpersonality” may be organized in terms of an emphasis on what is 
perceived as the defining, singular, and stereotypical norm of a culture—a 
phenomenon of the “exaggeration” of a stereotype, similar to that which 
determines hypercorrection. This would account for the attested 
hyper-nativization of the behavior of advanced learners and much of the 
anecdotal evidence on second culture acquisition. 

However, while this explanation can certainly account for some of the 
apparent contradictions that we have observed, it is necessary to take note of 
other facts as well. Regardless of the presence or absence of an event of 


*4 T antolf quotes a study conducted by Yoshida, K. (1990), “Knowing vs. behaving vs. 
feeling: Studies in Japanese bilinguals,” in L. Arena, (ed.), Language Proficiency: 
Defining, Teaching, and Testing, New York: Plenum and refers to Yoshida’s cross-cultural 
word association tests, in which Japanese-English bilinguals behave like monolinguals in 
culture-specific terms. The bilinguals behave like Japanese when associating with Japan 
and like Americans when associating with Christmas. “This leads one to suspect that 
when a concept is unique to the C[ulture]2 and perhaps known only at a conscious 
intellectual level in the C1, C2 learners may learn the concept, at least at the metalevel, 
because it is marked relative to culture.” 
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sudden realization (which, in any case, presumes the existence of a “building 
up”), NNS19’s statement regarding the appropriateness of benefactive use 
for the purpose of defining teacher-pupil role-based expectations suggests 
that this learner can adopt a principled behavior, which she is capable of 
bringing to consciousness and elaborating on to a certain degree. In the 
excerpt, she comments on a native speaker’s use of a benefactive; however, 
while comparing her own norm to that of the native speaker, she claims the 
right to a distinctive identity and the right to use Japanese in the manner that 
she wishes. According to her, a teacher should not be personally concerned 
about students’ homework. The affective (as well as social) dimension that 
she can conceive of between herself, as a teacher, and her students is one that 
appears consistent with the way in which she could, perhaps, have coded this 
dimension in English. However, she is content to adopt a more 
“interconnected” stance in other roles (tenant vs. caretaker, etc.), connoting 
an “intersubjective” understanding of relationships that is possible because 
she creatively exploits this linguistic resource. This learner may have 
adopted another (hypercorrected) subpersonality; however, she demonstrates 
that she is not entirely ruled by it. The Japanese language introduces her to 
frames that she did not possess when she spoke only in English; yet, it does 
not force her into these frames completely. She can choose not to adopt 
frames that present a problem to her. 

Nevertheless, it is necessary to account for the apparent inconsistencies 
between the metalinguistic judgments and the actual performance of the 
learners. These interviews confirm the well-known fact that learners, like 
native speakers, are often inaccurate while providing self-reports of their 
actual performance. It is interesting to note that learners typically refer to 
“misalignment” even in cases in which the structures are used extensively 
and fluently; self-assessments appear to “disown” what is considered to be a 
“typically Japanese” feature. The direction of this misrepresentation is 
therefore the opposite of one that is much more widely attested, i.e., the 
claim of following normative and canonical rules while being deviant, 
vernacular, etc., in actual production. The nature of such metarepresentations, 
however, is the same—an ideologically motivated narrative about the self, 
which is not necessarily contradictory. If one considers the actual usage of 
benefactives during the interview in Japanese and the metalinguistic 
judgments about these benefactives during the post-interview session in 
English as two separate and distinct activities that have separate, distinct 
aims and belong to separate, distinct discourses, this inconsistency ceases to 
be problematic. While the interview (conducted entirely in Japanese) 
instantiates the “subpersonality” that adopts target norms (or any subjective 
reelaboration of them, which could exceed native norms), the post-interview 
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session (conducted in English) is an instantiation of the “subpersonality” that 
can depart from those norms. It is necessary to be aware of the risks involved 
in taking the research participants “at their word” (Block, 2000: 758) and of 
the many dimensions of the interview as a social context. Consequently, the 
interview data provides “representations of real events” as well as 
“presentations of the individual speaking” (ibid.). According to this 
characterization, the learners’ interviews and post-interview comments must 
be conceived of as_ activity-driven behavior and metacognitive 
conceptualizations of that behavior, respectively, serving different “politic” 
aims that are defined by the setting and the “dominant” language used at the 
time. In other words, it is possible to understand the learners’ display of 
“discomfort” toward the use of benefactives as a means of claiming an 
identity distinct from that of the native speakers of the target language; this is 
unsurprising, if not predictable, in situations in which “nativeness” is no 
longer relevant. 

The qualitative difference in the case of advanced second language 
users is not only that they use benefactives fluently and without hesitation 
and are confident with idiomatic uses but also that they can extend this use to 
non-idiomatic and novel contexts, thereby demonstrating undisputed creative 
skill. 


Some conclusive observations 

In this paper, we have examined the way in which learners of Japanese 
decipher, adopt, and ultimately adapt to one of the expressive features of the 
language that is likely to be particularly demanding due to the multiple 
functions it performs. The profiles of different proficiency groups outline 
differences in frequency of use and accuracy and a close reading of 
individual interviews reveals differences in fluency. A feature that is 
common to all the groups is a general disassociation from the social values 
of hierarchical subordination or old-fashioned mannerisms that (in the 
subjective or argumentative construal of such values by most learners) 
allegedly govern the use of benefactives, and a distancing from the use of 
benefactives as honorifics. It has been argued that the gap between the 
learners’ actual performance and the constructions presented in their 
metalinguistic comments is evident in the different discourses underlying the 
two activities, i.e., the ethnographic interview conducted in Japanese and the 
reflexive post-interview session conducted in English. 

I have argued that this competence (evidenced by a fluent, if not always 
canonical use) is achieved in a subliminal manner. Benefactives are adopted 
because they occur frequently in the environment and have a grammatical 
basis. Each instance of the use of these verbs aids the establishment of a 
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cognitive possibility that did not previously exist, namely, that of framing 
events as facts whose intersubjective effects are as salient as the nature of the 
events they evoke. In my opinion, the habit of codifying the (alleged) 
existence of such effects is developed through extensive exposure, through 
repeated imitation, participation in social networks, and meaningful 
interaction. This process is subliminal because, on account of the social 
ideologies that it potentially competes with when it does not clash with them, 
it is rarely acknowledged openly. 

The “cognitive restructuring” (Danesi 1993, quoted in Lantolf 1999: 43) 
is probably driven by a progressively increased familiarity with situated 
structures. It explodes in a burst of creativity once two conditions are 
fulfilled, i.e., once the “processing” constraints are overcome and once the 
learners are no longer inhibited by an analytical reading of the auxiliaries 
and an overreliance on the literal notion of benefit transfer. It is only when 
benefactive constructions become routinized and grammaticalized that a 
truly pragmatic (i.e., strategic and subjective) meaning is achieved. Thus, 
creativity can be considered to emerge from conventionality and 
routinization. The hypercorrective nature of the attested frequent use does 
not appear to preclude a principled avoidance (or a mere non-use) of 
benefactives in specific, and perhaps potentially controversial, cases. 

Our discussion has highlighted the manner in which an excessively 
close lexical reading of these constructions may become an interpretative 
“trap”; learners must first understand that the conceptual schema evoked by 
notions of “favor” is one whose structure is rather different. 


MR3: There are two aspects to it, the first one being a demonstration of respect and thanks, 


since this form implies either a request or a favor that has been received. However, as in 


“uchi ni kite kureta toki” [when he came+BEN to my place...], we can observe that the 


conception of “a favor” goes beyond that of English. The second aspect is that it clarifies 


who is doing what and for whom, which is particularly useful in Japanese due to subject 


omission; this might explain the use of these forms, rather than merely politeness. 


The introduction of auxiliaries after the use of benefactives as main 
verbs is a pedagogical convention; this is because of rather obvious reasons 
that are related to structural complexity and linear progression. However, the 
literal interpretation of the verbs—whose deferential symbolism is quite 
transparent—compounded with the availability of honorific options and 
consequently, their association with politeness-related meanings, intensifies 
the challenge posed by these structures and renders them items that the 
learners apparently struggle with for a long time. It is perhaps unwarranted to 
use these speculative observations to draw strong conclusions regarding the 
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direction of classroom practices; however, some general guidelines like the 
following could be suggested. The distinction between meanings derived 
from honorific vs. non-honorific contrasts (e.g., kureru vs. kudasaru) and 
those that do not involve honorific distinctions should be clearly outlined in 
order to prevent stereotypical representations. The relationship of this set of 
verbs with the other structures in paradigmatic oppositions should be 
explored. It is necessary for teachers to be aware that these verbs activate 
conceptual frames that carry strong ideological implications; thus, a 
metaphorical reading should be encouraged, while an emphasis on the literal 
meaning, which is no longer immediately accessible even to native speakers, 
should be avoided. 


EG3: I personally like these forms—I like the way they can subtly show gratitude without 
getting too soppy about it. The English translations which were used to help us understand 


Japanese always sounded a little over the top. 


DK3: I find that processing them through the idiomatic use of L1 works. “-te kureru” is 
taught as “someone gave to me by (doing) -te.” However, if it is removed from its literal 


translation and treated with an idiomatic interpretation such as “somebody* did —te” 


(which was nice of them*/benefited me), the mental gymnastics required to process who is 
doing what is lessened. 


Since I was able to find a sufficient number of proficient speakers of 
Japanese who employed these constructions fluently despite their allegations 
of avoidance and misalignment, I am inclined to believe that a negative 
evaluation certainly does not prevent acquisition. The extent to which it may 
handicap or delay this acquisition cannot be determined using the data 
presented in this study; however, this is an issue that warrants consideration. 

I have characterized “giving/receiving” verbs as indices of frames of 
interconnectedness, and have suggested ways in which cognitive 
restructuring and acculturation may occur. I have attempted to provide an 
“objective” (or at least “detached”) conceptualization in accordance with the 
conventions of scholarly enquiries. Nevertheless, my “story” is not more real 
than those of the users in situated contexts; it merely pursues different goals. 
In an attempt to answer the question of “what these verbs reveal about the 
‘Japanese’ way of speaking,” a remarkably talented, advanced learner 
describes her “story” as follows: 


JHMA: I feel slightly hesitant in answering this and don’t want to generalize too much. 


However, it does seem to be the case that there is never an entirely equal relationship in 
Japanese conversations. It is pretty much impossible to speak naturally without thinking 
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about how you and your interlocutor relate and what you can do for each other. This might 
seem negative, but actually, the idea of favors criss-crossing back and forth eternally seems 
like quite an active concept of interaction to me. 

I think that you HAVE to think about these things. But I think that being forced to do this can 
be very positive and feel liberating. 
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The Zone of Proximal Development and 
Second Language Acquisition 
— Beyond Social Interaction — 


Amy Snyder OHTA 


The Zone of Proximal Development (ZPD) was originally intended to 
assess the educative potential of children. Increasingly, it has been applied 
to understand how adults increase expertise in second and foreign languages 
(hereafter L2'). Social interaction—collaboration between the child and an 
adult or more capable peer—is built into Vygotsky’s definition of the ZPD. 
Social interaction has also been the focus of applications of the ZPD to adult 
language development*. However, adults differ from children in a variety 
of ways. Unlike preliterate children, adults can gain assistance not only via 
social interaction, but also through a variety of other sources including 
reference books, on-line helps, or video recordings. By engaging with 
these materials, adults meet challenges with assistance and language 
acquisition occurs. Building on these ideas, this chapter proposes that the 
ZPD is more than a between people space. Rather, I propose that the ZPD 
itself is internalized through the process of cognitive development, so that 
adults are able to create and manage their own ZPDs, not only in interaction 
with people but also in interaction with books and other media. 

In terms of structure, the chapter first introduces the ZPD, does 
definitional work, and considers how the ZPD has been applied to L2 
research. The relationship between the ZPD and the concept of regulation 
is explored. Then, the limits of the ZPD, defined as a social interactive 
space, are considered with reference to an earlier study of highly proficient 
speakers of L2 Japanese. Problems with considering both the ZPD and 
language development as strictly social-interactive (between people) 
processes are raised. A reconceptualization of the ZPD is then proposed as 
a solution to these problems. It is reconceptualized as a unit that initially, in 
infancy and early childhood, exists only in social interactive contexts. As 
the child develops, the ZPD is gradually generalized as it is internalized such 


1 ' . ae 
While “L2” is traditionally read as “second language,” here the term refers to second 


languages, foreign languages, and subsequent languages, for multilingual individuals. 
The terms “development,” “learning,” and “acquisition” are all used in this chapter to talk 
about how people increase in L2 facility. 


2 
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that learners can form ZPDs both between themselves and others and when 
interacting with texts and other media. Finally, these ideas are tested via an 
analysis of interviews with five advanced L2 speaker/readers of Asian 
languages (Japanese, Chinese, Korean and Sanskrit). In the discussion of 
interview findings, I conclude that the concept of the ZPD can be applied to 
a broader range of learning settings for adults than children, because adults, 
unlike young children, have internalized the ZPD itself. 


1. Development and Definition of the ZPD 

Vygotsky wrote about the Zone of proximal development (ZPD) in the 
1930s, in response to reliance on IQ testing for educational decision-making 
in the Soviet Union. IQ scores had been found to be inadequate to predict 
the developmental potential of children, particularly those with disabilities. 
Vygotsky was concerned about why this was so, and worked to understand 
why young children did not perform in school as was expected according to 
their IQ scores (van der Veer and Valsiner, 1991). While it might be 
expected that children with higher IQs would gain more from school, 
Vygotsky and his colleagues found that those with lower IQs gained more 
points as a result of schooling. In addition, students with identical scores 
did not necessarily experience the same gains. Vygotsky posited that 
because the IQ score only showed what the child could do independently, it 
did not reveal developmental potential. Rather, a better indicator of a 
child’s potential development was what that child could be helped to achieve 
with prompting or other guidance. The more that a child could do with 
some assistance, the larger the ZPD and the greater the potential for future 
growth. Vygotsky felt that differences in ZPDs in children with the same 
IQ score could explain why some children surpassed others, and why some 
children who initially appeared to have lower potential as judged by test 
scores seemed to gain more from school than others who had higher IQs. 

Vygotsky also observed that there was a ceiling on what a child could 
be helped to accomplish—that children could only imitate what was within 
their capacity to acquire the ability to do themselves. In other words, 
collaboration works as a powerful tool in stimulating mental development to 
the extent that the child is able to productively participate in the collaboration. 
When something becomes too difficult, however, collaboration cannot be 
used to push the child further. Vygotsky stated that: 


In collaboration, the child solves problems that are proximal to his level of 
development with relative ease. Further on, however, the difficulty grows. 
Ultimately problems become too difficult to resolve even in collaboration. The 
child’s potential for moving from what he can do to what he can do only in 
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collaboration is the most sensitive index of the dynamics of development and the 
degree of success that will come to characterize the child’s mental activity (Vygotsky 
1987, p. 210). 


According to Vygotsky, the ZPD is “the distance between the actual 
developmental level as determined by independent problem solving and the 
level of potential development as determined through problem solving under 
adult guidance or in collaboration with more capable peers” (Vygotsky, 1978, 
p. 86). Challenges that are beyond the child’s independent ability but 
which can be achieved in collaboration with another push the child’s 
development forward. In collaboration with another, what was impossible 
becomes possible—the child can achieve what s/he could not accomplish 
alone and this achievement works to advance his or her mental functioning. 
In its power to conceptualize the role of collaboration in the child’s mental 
development, the ZPD has come to be used less as an assessment tool and 
more in order to understand how learning occurs. The concept of the ZPD 
has helped to increase appreciation of how dialogue, collaboration, and 
other forms of face-to-face interaction function to stimulate cognitive 
development. 

Application of the ZPD has moved beyond the arena of child 
development: Today the ZPD is used by those interested in adult 
development as well (John-Steiner, 1985), and has drawn the attention of 
second language acquisition researchers. For those interested in promoting 
learning, an important principle, broader than the initial formulation of the 
ZPD, has been applied in a variety of settings. That is, if what a person can 
do with assistance, he or she will soon be able to do without assistance, then 
providing opportunities for learners to be assisted with challenging tasks 
should aid development. Researchers interested in second language 
acquisition have begun to use the ZPD to consider a variety of research 
questions. Donato (1994), Brooks & Donato (1994), Ohta (1995, 2001, 
2003), and Lantolf and Aljaafreh (1995), among others, have looked at the 
mechanisms of the ZPD—how it is that zones of proximal development 
emerge in classroom and tutorial interaction. It is also interesting to 
consider how true peers are able to work together to handle L2 challenges. 
Beyond the pooling of expertise (Donato, 1994) in which learners contribute 
differential knowledge to the pair or group, Ohta (2001) also found that 
projection processes and shifting participant roles allowed true peers to assist 
one another to accomplish what neither could do without collaboration. 
Researchers have also applied the ZPD to investigation of classroom 
grammar development (Ohta, 2000; Swain, 2000), developing writing skills 
(Guerrero & Villamil, 2000), and interlanguage pragmatics (Ohta, 1997, in 


Presented by: https://jafrilibrary.com 


Presented by: https://jafrilibrary.com 


158 Amy Snyder OHTA 


press). 

Ohta (1995, 2001) has rephrased Vygotsky’s definition of the ZPD to 
better suit the language learning context. In this tailored definition, the 
ZPD is “the distance between the actual developmental level as determined 
by individual linguistic production, and the level of potential development as 
determined through language produced collaboratively with a teacher or 
peer” (Ohta, 2001, p. 9). This definition was developed in response to data 
that found adult learners jointly produced utterances that neither learner 
could manage alone. This joint production becomes a part of each learner’s 
interactional experience, helping them to move toward future independent 
production. The ZPD is helpful in understanding how collaborative 
language use builds into individual language learning”. 


2. The ZPD, internalization, and regulation 

Vygotsky’s general genetic law of cultural development provides 
another metaphor—that of social and _ psychological planes—to 
conceptualize how development occurs: 


Any function in the child’s cultural development appears twice, or on two planes. 
First it appears on the social plane, and then on the psychological plane. First it 
appears between people as an interpsychological category, and then within the child as 
an intrapsychological category. This is equally true with regard to voluntary 
attention, logical memory, the formation of concepts, and the development of 
volition.... [I]t goes without saying that internalization transforms the process itself 
and changes its structure and functions. Social relations or relations among people 
genetically underlie all higher functions and their relationships. (1981, p. 163) 


Vygotsky’s claim is not just that learning occurs first in a social space before 
it is internalized by the learner. Vygotsky sees all cognitive development as 
grounded in social interaction that preceded and formed it; children’s 
attention, memory, self-control, and reasoning all develop through social 
interaction. 

The ZPD is sometimes related to the concept of regulation (Wertsch, 
1985). Regulation refers to how a person is able to manage and direct their 
cognitive processes. As children develop, they move from being object- 
regulated, to other-regulated, to self-regulated. The object-regulated child 
moves and acts in response to objects in the environment. For example, if a 
child sees an interesting toy, recognition of that object may prompt the child 


3 Some writers have misunderstood the ZPD as being analogous to Krashen’si+1. See 
Dunn & Lantolf (1998) and Kinginger (2001) for useful definitional work. 
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to go to pick it up. For the child who is object-regulated but not yet 
other-regulated, direction by an adult to do something different is unlikely to 
be effective. It wouldn’t be that the child was ignoring the adult—rather, 
the object-regulated child may not yet be able to respond to the intervention 
of another. If asked to get a ball, but on the way to the ball there was some 
other interesting toy, the object-regulated child might pick up that toy instead, 
even if prompted by the adult to continue toward the ball. Through 
interaction, the object-regulated child can learn to change his behavior in 
response to another. The adult may engage with the child and work to 
focus the child’s attention on the ball through words, gestures, imitation, or 
by going to the ball himself. Joint attention may help the child to do what 
couldn’t be done without assistance—to get the ball. Joint attention allows 
the child to focus and meet a challenge. Through this experience, the child 
is other-regulated. Other-regulation means that the child is able to follow 
the directions of others in performing tasks. Over time and interactive 
experience, the child learns to provide prompts to him or herself, and no 
longer needs the adult to assist in focusing attention. At this point, the child 
behaving in a self-regulated manner. Even if distracted on the way to the 
ball, the self-regulated child can refocus attention and continue to the ball 
without additional prompting. 

The ZPD has been related to the concept of regulation in that it is the 
assistance of another that pushes development forward (Lantolf 1984, 
Lantolf and Aljaafreh 1995), in the examples above, the assistance of the 
adult that prompts continued focus on the goal. As stated by Lantolf (1994), 
“the transition from other-regulation, or intermental activity, to self-regulation, 
or intramental activity, takes place ... in the zone of proximal development 
where child and adult engage in the dialogic process” (p. 12). Here, Lantolf 
equates other regulation with intermental activity. Intermental activity is 
the activity that occurs between people—collaboration. As the child 
interacts with the adult in the ZPD, his or her world is gradually re-shaped 
through the development of intersubjectivity—the ability of the adult to 
tailor interaction to the child’s, and the child to adapt to the adult’s view of 
the problem at hand. 

For adults as well, assistance provided in the ZPD can be viewed as 
other-regulation. Unable to continue on one’s own, the individual must 
seek the assistance of another. Incorporating this assistance allows the 
individual to return to self-regulation. In this view, the seeking of 
assistance is a regression to an earlier stage of development. This is 
allowed theoretically by the concept of continuous access—that is no matter 
how highly developed an individual is, earlier developmental stages are 
available to draw upon as resources. When the adult encounters a task that 
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is highly challenging, that person may revert to other-regulation, 
incorporating the assistance of another to solve the problem. After learning 
through another’s assistance, the adult may be able to regain self-regulation 
and accomplish the task without assistance in the future. 


3. Probing the Limits of the ZPD 

In considering the role of the ZPD and intermental spaces in foreign 
language development researchers have focused on social interaction—how 
language learners interact with each other or with more proficient speakers, 
and how that impacts their progress. Of course, interaction with others is 
critical to language development. Learners have such opportunities in 
foreign language classes. Sojourners abroad have plentiful opportunities to 
develop via interactions with local speakers of the language. 

However, language development is not limited to what happens in 
social interactive contexts. For adult and other literate L2 learners, social 
interactive opportunities are generally interwoven with periods of self-study. 
And, while the ZPD has been considered strictly as a social space, it may be 
useful to apply the construct to language development in other modalities as 
well. Ohta (2001) noted: 


For children leaming their first language[s], social interaction is historically the 
earliest locus of internalization. L2 learners, adults who already have an internalized 
language, are able to use their already developed cognitive skills (including literacy 
skills) to access the L2 outside of the social interactive setting through, for example, 
L2 reading or listening opportunities (p. 11). 


Learner interaction with linguistic materials such as written texts and videos 
may also result in language development in the ZPD. The five highly 
fluent and literate learners of Japanese whom Ohta (1990) interviewed all 
reported studying hard, seeing their experiences studying the language alone, 
involved with their texts, tapes, flash cards, and videos, as critical to their 
language development. The concept of “picking up” a language—of 
leaning a language without studying, just by using it in social 
situations—did not resonate with the five successful learners. While they 
sought interaction with Japanese people, they also reported working hard to 
study Japanese. All completed academic programs of study in Japanese 
and excelled in their Japanese language courses. When asked how they 
developed their high level of proficiency the participants said, 


e “you have to be aggressive and study it” 
e “being dedicated with your homework—there’s no substitute for that” 
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e “7 don’t think I'm a linguistic genius . . . it was just hard work” (Ohta, 1990, p. 52) 


Ohta (1990) did not focus on study methods, so further details concerning 
what the hard work entailed were not sought. All of the participants also 
spent two or more years in Japan and applied their energy and effort to 
placing themselves in interactive contexts with Japanese people. For these 
learners both interaction with people and spending time studying were 
important. It is not clear what the role of the ZPD might have been in these 
“study alone” contexts. However, the five learners were unanimous about 
the importance of such work in their development of high levels of 
proficiency; the use of focused periods of study in addition to and as an 
augmentation of interpersonal experiences built their L2 skills. 

Ohta (2001), in a longitudinal study of Japanese language learners, 
followed college students at a U.S. university through a year of Japanese 
language instruction, analyzing their classroom interaction. This study did 
not consider learner efforts to learn Japanese that took place outside of class. 
However, the learners had homework to do, and had to study as a part of 
their courses. Any teacher will report that students who study experience a 
more rapid rate of language development. However, the mechanisms 
involved are not clear. Ohta (2001) did not investigate what the impact of 
studying outside of class had on the social interaction learners engaged in 
during class. 

“Studying,” however, does seem to be linked with social interactive 
performance, as language teachers will attest. To illustrate with an 
anecdote, while collecting data for another study, I attended and recorded a 
volunteer in a third-year language class. Later, I ran into her in the hallway. 
“Oh!” she said, “I’d forgotten you were coming today!” I said, “That’s 
okay!” She responded, “No, it’s not—I always study harder the night 
before you come to record me. Since I forgot, I really blew it today.” 
Why would she study harder when I was coming? To maximize her 
chances of getting a strong enough grasp of the material that she’d be able to 
mentally access new lexical items and forms during social interactive tasks 
in her Japanese class. Her studying was designed to optimize her classroom 
performance. Learning strategies research also amply documents the 
individual work that adult learners do as they work to optimize their 
language development (Oxford, 1992). In addition, studies of foreign 
language reading document the interactive processes involved in reading and 
how accessing texts functions to advance linguistic skills (Carrell, Devine & 
Eskey, 1988). 

While the importance of interacting with texts in the L2 development of 
literate adults may seem to be obvious from the experience of those for 
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whom time spent studying has been a critical part of L2 developmental 
processes, earlier researchers have dismissed the impact of studying on 
second language acquisition. For example, researchers have claimed that 
“learning” cannot become “acquisition” (Krashen & Terrell, 1982). In this 
chapter, I take a position antithetical to Krashen’s. That is, the various types 
of interaction the adult learner participates in—whether with people in social 
settings or with texts when alone at a study desk—are all spaces where 
language development occurs in the ZPD. Social interaction and individual 
study sessions are interdependent in terms of their contribution to L2 
development. 

This position rejects any dichotomy between what have been called 
“learning” and “acquisition.” In Vygotskian thought, cognitive processes 
are considered to be part and parcel of each other, interwoven into the same 
fabric. Separating language development into learning and acquisition 
creates an artificial distinction—elements which cannot be rejoined to create 
the whole of language development. Vygotsky asserted that analysis of the 
elements of psychological processes, no matter how well each element is 
understood, cannot result in a deepened understanding of the whole process. 
Analyzing a process based upon the use of elements, 


begins with the decomposition of the complex mental whole into its elements. This 
mode of analysis can be compared with a chemical of analysis of water in which water 
is decomposed into hydrogen and oxygen. The essential feature of this form of 
analysis is that its products are of a different nature than the whole from which they 
were derived. The elements lack the characteristics inherent in the whole and they 
possess properties that it did not possess. ... Hydrogen burns and oxygen sustains 
combustion (Vygotsky 1987, p. 45). 


As shown in this example, to understand the properties of water, analysis of 
the elements hydrogen and oxygen is not useful. Rather, a sample of the 
water itself (what Vygotsky terms a “unit”) must be studied. A “unit” 
contains all characteristics of the whole being studied. To apply Vygotsky’s 
idea about units of analysis to L2 development, I propose the ZPD to be an 
appropriate unit that contains the whole within itself. The ZPD initially 
appears only in social interactive contexts. As development proceeds the 
process of the ZPD itself is gradually internalized. The ZPD can, therefore, 
be understood as a smaller unit of the whole, which is linguistic/cognitive 
development. The ZPD is a process that works to compose linguistic/ 
cognitive development: It is contained by the whole but also works to form 
it. 

Emerging initially in interaction between a child and the adults who 
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interactively structure the child’s social world, the ZPD is generalized by the 
child over time as she learns by interacting with adults and peers in a variety 
of interactive settings. While the ZPD is primarily managed by adults in 
interaction with small children, as children develop they are able more and 
more to manage their own ZPDs, working to seek assistance and to manage 
the level of assistance provided such that they can meet the challenges they 
encounter. This growing ability to manage the ZPD is, I propose, a mark of 
how the ZPD itself is slowly being internalized. As literacy is established, 
the ZPD is further internalized so that children become able to carve out 
developmental spaces, as they interact with written materials, initially with 
the assistance of another, working more and more independently over time. 
Internalization involves a gradual increase in the individual’s skill in 
managing their own ZPDs and creating ZPD-rich settings for themselves. 
Students who are successful academically are those who are able to do this 
well. The highly literate adult has not only learned to read and write, but 
also to use written sources to create zones of proximal development. I 
propose, therefore, that the ZPD is a unit of language development across the 
lifespan. It begins as something that, in itself, is not part of the human 
organism, but must be internalized via the social interaction processes 
through which it is formed. In the end, the ZPD itself becomes a tool used 
by cognitively mature adults as they go through life seeking new skills and 
new knowledge, including knowledge of and skill in foreign languages. 


4. A Study of Five Learners of Asian Languages 

Empirical evidence is needed to test this proposal that the ZPD is, in 
and of itself, subject to internalization, thus functioning for adults in all 
linguistic modalities. Therefore, an interview study was designed to better 
understand how the ZPD functions for adults in social interaction and 
interaction with texts and other media. 


4.1 Participants 

An advertisement soliciting subjects for the interview study was 
circulated on electronic list-serves. Asian languages were selected as a 
focus of the study because of the difficulties their writing systems pose to 
native English-speaking learners. Five advanced and highly proficient 
foreign language learner/users agreed to participate in the study. Learner 
pseudonyms and languages are listed in Table 1: 
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Table 1. The study participants 


Pseudonym Advanced Highly proficient Profession 
Niles Korean Japanese graduate student 
Mark Japanese undergraduate 
Evelyn Japanese, Chinese undergraduate 
Sanjay Sanskrit graduate student 
Victor Chinese professor 


The distinction between “advanced” and “highly proficient” was made 
based upon facts gleaned from the interviews, not from formal assessment of 
proficiency. “Advanced skills” refers to languages studied in advanced 
university coursework (fourth year or higher), but not used with professional 
facility. Not all participants with advanced skills were currently enrolled in a 
language class, but all were of a level to benefit from advanced college 
coursework in their languages. All of the advanced learners except Mark 
seemed to have enough facility with their languages to read academic 
material with dictionaries. In terms of speaking, all except Sanjay, who 
studies Sanskrit, appeared to be comfortable using their languages for 
everyday communicative purposes. “Highly proficient skills” refers to 
professional use of the language (for example, using the language of 
specialization to do professional translations, read professional or academic 
literature, write articles, or give lectures). They are able to read high level 
materials in their fields. In terms of speaking, both appeared to be 
comfortable using their languages for sophisticated communication 
requirements requiring a highly developed vocabulary. These individuals 
differed from those described as having advanced proficiency in that neither 
consider himself to be studying the language of specialization. Rather they 
saw themselves as learning their languages of specialization via contexts of 
use. Victor, for example, said “I don’t actively study, but I still encounter 
situations all the time where I’m learning something” (V, 12-14). Therefore, 
in this chapter, the word “learner” will be used to refer to all participants, 
regardless of their language proficiency. 


4.2 Data Collection 

This study utilizes in-depth interviewing as its main research tool. 
Learners were asked a range of questions about their language learning 
strategies, study methods, and help-seeking behavior. These questions are 
listed in the appendix. Questions focused on study and help-seeking 
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methods and how these may have changed as the learner developed greater 
facility with the language of specialization. Part way through each 
interview, the idea of the zone of proximal development was introduced to 
allow learners the opportunity to apply the ZPD in talking about their 
language learning and use experiences. After initial analysis of interview 
data, each participant was then sent a survey listing all of the themes 
mentioned by all participants, with the opportunity to note any areas missed 
in the interview which the individual felt were important in his or her own 
language learning experience. These methods are described in further 
detail below. 


4.3 Analysis 

Interviews were audio recorded and transcribed for analysis. 
Transcripts were analyzed qualitatively through analysis of emergent themes. 
One theme common to the interviews was that of assistance. Another was 
management of the language learning process. Details of assistance-related 
themes are listed in Table 2. 


Table 2. Theme: Sources of assistance 


Theme Examples or details 
Gaining assistance Trying out grammar or vocabulary to see if they were correct, 
from other people trying to hear examples of usage, asking questions about usage, 


how to phrase something, word meaning, or tones, talking with a 
conversation partner, bringing collected questions to a 
conversation partner, writing/editing assistance, assistance with 
classical language interpretation/reading, and reading together 
with a professor. 


Gaining assistance Extra-textual sources mentioned are on-line, electronic and bound 
from textual sources _ dictionaries, Google, self-created databases or dictionaries, written 
texts as a resource for writing, translations, and literary criticism. 


Preferred sources of Differing preferences were expressed, whether learning from a 
assistance language expert, social interaction, dictionaries, other textual 
sources, or a combination of these. 


As summarized in Table 2, participants had a wide range of resources 
and methods of seeking assistance, as well as variable preferences for the 
type of assistance they found particularly useful. In terms of managing the 
language-learning process, they mentioned the strategies and study methods 
shown in Table 3: 
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Table 3. Theme: Managing Language Learning 


Theme Examples or details 

Choosing to learn, Choosing to learn or ignore particular vocabulary or characters. 

ignore, or avoid Examples: Choosing to avoid transitive verbs (Japanese), 
4-character idioms (Chinese), and writing overly complex 
sentences 

Study methods Dictionary reading/analysis/copying, talking with conversation 


partners, seeking out conversation opportunities, reading, 
annotating texts, marking tones, hand-copying, translation, 
watching TV, studying grammar rules, reading aloud, repetition, 
making and/or studying flashcards, making and/or studying 
vocabulary lists, and repeated writing of characters or vocabulary. 


After these themes were delineated, a Microsoft Excel table was 
compiled for each participant showing the all of the themes mentioned in the 
interviews, marked with “yes” where a particular item was mentioned by 
each interviewee. Individualized tables were then sent to each participant. 
They were asked to read over what was shown, and then note what methods 
of gaining assistance, assistance preferences, or methods of managing the 
language learning tasks they might use, if any, in addition to what they 
mentioned in the interview. In this way, participants were able to fill in 
anything they might have not discussed in the interview but which was 
relevant to their own assistance-seeking and language learning experience. 
All five participants returned the tables with additions and comments noted. 


5. Results 

Each of the five interviews provided evidence of the learner’s use of 
ZPDs in their language development. Each learner will be discussed 
separately, with further discussion of the group in the discussion section. 


5.1 Niles: Japanese, Highly Proficient 

Niles is a graduate student who arrived at graduate school already 
highly proficient in Japanese from residing and studying in Japan. He 
began working as a professional translator prior to graduate school. 
Currently, he works as a translator and teacher of Japanese. 

In the interview Niles spoke a great deal about tailoring his efforts to 
learn Japanese around what was developmentally accessible or of interest to 
him. For example, he didn’t bother trying to learn Japanese pitch accents 
until recently, because he couldn’t hear them. “The reason I wasn’t 
interested in doing it was because I just couldn’t get it’—even with 
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assistance, pitch accents were beyond him (Niles, 30-31*). Now that he can 
hear them he is working to improve in this area. This is an example of how 
Niles used the ZPD—first, what was outside of his ZPD—to organize his 
language learning efforts, then adjusting his goals to include working on 
what he had previously rejected. Related to vocabulary learning, Niles said 
that getting assistance, in general, doesn’t help him to retain a particular 
word. Rather, along with gaining assistance he said that he has to have an 
interest in the word in order to learn it. That interest leads him to do the 
studying necessary to internalize the word for future comprehension and use. 

In learning the language in Japan, Niles expressed a reluctance to use 
new grammatical structures gleaned from class or textbooks without first 
hearing them used in a social setting. He reported working to merge what 
he learned from textual sources—dictionaries and textbooks—with real-life 
examples from social interaction. He said that he made use of the social 
dimension as a stepping stone or scaffold to assist him in using the form 
himself. His strategy was to “see if you can hear it and then try to use it 
yourself, to extend it after you’ve heard it from someone” (Niles, 288-290). 

Niles said that he made use of both textual and social interactive 
sources of assistance. In selecting texts, he reported choosing those which 
were accessible to him. For example, early in his language learning history 
in Japan, he said that he purchased a Japanese-Japanese dictionary designed 
for elementary school students. Some of the primary methods of studying 
he related were reading the dictionary, looking up the words he didn’t know 
in a Japanese-English dictionary, and making extensive notes of words, 
definitions in English, and example sentences. This was a selective process 
of choosing words that interested him or that were noted as common words, 
and using other textual helps to work to understand this authentic text and 
learn the words. In terms of the ZPD, this was an interactive process 
involving selection of a developmentally appropriate authentic text (the 
elementary school dictionary) and using another textual source of assistance 
(the Japanese-English dictionary) for assistance, interacting with these via 
written notes which also served as a source of support. To connect this 
process to that described in the previous paragraph, Niles said that he also 
used social interaction, working to hear what he studied and use what he 
learned in a social interactive context. The written work provided a support 
for social interaction. Language development via social interaction, for 
Niles, appears to be interlaid with his experiences studying alone such that 
the latter enhanced the impact of social interaction. 


4 ; ; 
These numbers refer to line numbers of each transcript. 
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5.2 Mark: Japanese, Advanced Student 

Mark is an undergraduate student enrolled in a fourth-year Japanese 
course. He has over ten years of history studying Japanese, but with large 
gaps caused by sporadic college enrollment for reasons beyond his control. 
He describes a wide range of strategies to deal with the difficulties of the 
written language, as well as social strategies to involve himself in a 
Japanese-speaking community in the city he lives in. 

Mark has difficulty in tailoring his language learning efforts around 
materials that are within his ZPD. The main reason for this is that his 
studying is mostly related to his Japanese class, which is much too difficult 
for him. In his interview, he described all of the work he must do to access 
the texts assigned for his Japanese class, describing how he still could not 
really understand the readings even after an arduous process of dictionary 
use, text-copying, and re-reading. He said that he uses flash cards and lists 
to try to cope with the large number of vocabulary and characters he 
encounters, but that retention is a major problem. He also confessed to 
studying with a native speaker present who could simply tell him what each 
word meant, avoiding hours of dictionary work. Working with a person as 
a resource rather than a dictionary likely provides a more precisely tailored 
method of support, though Mark said that it didn’t help him to retain the 
vocabulary or characters he needed to learn, probably because the amount of 
material was overwhelming. 

To learn spoken Japanese, Mark discussed how he worked to find 
Japanese people willing to interact with him in Japanese. Mark said that the 
Japanese students he knows at his own university won’t speak Japanese with 
him. He attributes this to the fact that they have very strong English 
language skills. But, by visiting a local junior college with a program 
designed for foreign students, Mark made become friends with a group of 
Japanese students who primarily socialized with each other and were willing 
to interact with him in Japanese and help him with his Japanese homework. 


5.3 Evelyn: Chinese, Advanced Student; Japanese, Advanced Student 

Evelyn is completing a major in Japanese and is also taking some 
graduate coursework. While working on her major in Japanese, she has 
also been learning Chinese rapidly over a three year period by taking an 
intensive one-year course, continuing to take advanced Chinese coursework, 
doing an internship in Taiwan, and taking graduate-level university courses 
which require her to read academic articles in Chinese. She reports that her 
Chinese is better than or at least as good as her Japanese, which is 
remarkable considering that she has only studied Chinese for three years, 
whereas she lived in Japan for three years and has been studying Japanese 
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formally in the States for three years since then. In her interview, we 
focused on Chinese, her language of strongest facility and interest. 

Evelyn has a strong interest in written Chinese as well as in speaking 
and understanding the language. She said that she uses written Chinese to 
support her spoken language. For example, she reports enjoying the news 
in Chinese, using written captions on the screen to support the aural work 
involved. Unlike other TV shows, the news has this written component, 
allowing Evelyn to make use of the written text as a scaffold, helping to 
make the news accessible. This is an example of active management of the 
ZPD by selecting material which incorporates a textual form of assistance 
within it, assisting her to improve her comprehension of the spoken language. 
When she is exposed to a new word in social setting, she said that she 
follows that up with further studying of the written form of the word so that 
she can better learn the new vocabulary. Without interaction with the 
written language, she says she quickly forgets what she’s heard. With just 
interactive exposure, “they say okay and five minutes later [poof] ((hand 
gesture showing its gone))” (Evelyn, 260-264). Referring to getting 
assistance from her conversation partner, Evelyn reports “I don’t retain stuff 
that he tells me nearly as well as I retain stuff I’ve looked up.... Overall I 
somehow process or retain things better, or it’s just the interaction between 
the dictionary and myself” (Evelyn, 466-469). Evelyn used the word 
“interaction” to describe her work with the dictionary—interaction that is 
both visual and analytical, enabling her to retain new vocabulary better. 

Evelyn’s graduate courses in Chinese require her to read academic 
articles. She developed a reading method modeled after a recommendation 
of a German language professor she had years ago—to first look up all of the 
words and annotate the text, and then to read for content. This separates the 
help-seeking phase of her task (using dictionaries to look up unfamiliar 
vocabulary) from the work of reading for meaning. In the terms used by 
reading specialists, she begins with a bottom-up approach, moving to a more 
top-down approach in order to understand what the article is about 
(Rummelhart, 1977). Seeking support from dictionaries and annotating the 
text is often sufficient for Evelyn to learn new vocabulary. Evelyn reports 
“TJ have a pretty good memory, so I’m usually able to recognize characters 
pretty easily. ... I just make notes in the margins and I usually remember 
them when I see them again” (Evelyn, |. 99-107). Her strong visual 
memory eases her reading task, allowing further internalization of the new 
vocabulary. In terms of the ZPD, Evelyn’s use of the dictionary fills the 
gap between what she could not do without assistance (read and understand 
the article), and what she can do with that assistance (read and comprehend 
the article). 
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In using dictionaries, Evelyn says that she retains characters better if 
she uses bound dictionaries which require her to analyze the character’, 
rather than on-line dictionaries which ease the look-up process by finding 
highlighted text. Evelyn manages ZPDs in her language learning by 
organizing her studying so that the assistance she receives has maximum 
impact. By following up social interaction with dictionary use and 
prioritizing bound dictionaries as sources of lexical information, Evelyn 
maximizes her chances of retaining what she encounters. 


5.4 Victor: Chinese, Highly Proficient 

Victor is a professor who specializes in an area of China studies. He 
no longer considers himself to be studying Chinese, but still views himself as 
a learner because, he says, “I still encounter situations all the time where I’m 
learning something” (Victor, 14). He rarely consults native speakers for 
assistance, rather using dictionaries as needed. Unlike Evelyn who spoke 
of her use of dictionaries as an “interaction,” Victor said that when using 
dictionaries “I don’t have a sense that something is getting put together 
collaboratively and then I own it, I feel like I own it from the start” (Victor, 
614-7). This is an indication of Victor’s high level of accomplishment in 
Chinese—the language is something that he “owns.” He states: “I’m at 
the point now where I can acquire anything I need to on my own. For the 
short-term, retention is always an issue” (Victor, 332-334). And, while he 
confesses that retention is a problem, he also notes that he finds that he does 
not need some of the annotations he made when he previously used a text 
and refers to it again. When asked if he needs annotations again, he says 
“No, there are some I don’t need anymore,” continuing by adding “in fact I 
have gone back to articles that I read a long time ago . . . and I’ll be very 
amused at what I’ve annotated” (Victor, 199-205). In addition, he mentions 
that part of the retention problem he has is probably related to the size of his 
vocabulary and that the words he now needs to look up are not very frequent. 
Even so, because the materials he reads are high-level academic texts written 
for scholars, “there are huge amounts of new vocabulary words coming up 
all the time” (Victor, 170-172). Victor is expert at using various 
helps—dictionaries, whether on-line or bound, and Google—to get what he 
needs to handle most situations in Chinese. He said that he is comfortable 
with and enjoys this help-seeking process. Thought of from the perspective 
of the ZPD, Victor easily bridges the gap between what he cannot do without 
assistance and what he needs assistance with, and is able to do so 


5 ‘ . : : 
Looking up a character by its shape involves analyzing the character’s shape and 
determining its radical, the portion of the character used to classify it. 
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independently. 

Part of what Victor reported doing in his earlier days of learning 
Chinese was to create his own helps. For example, he created a glossary of 
field-specific terminology. In reading, he said that he only needs assistance 
of another person when he encounters classical Chinese in an academic text. 
In general, he noted that native speakers who are not expert in his field 
cannot help if he encounters difficult vocabulary; and, even if they could, 
Victor prefers to handle these problems independently. Victor does rely on 
native speakers when he writes papers or prepares presentations in Chinese. 
He continues, however, to rely on texts as well, borrowing structures from 
them. He states, “There are a couple of authors in Chinese whose writing 
style I like. I find it very clear and simple, so I will go to their papers” 
(Victor, 308-311). And, the texts he produces with native and textual 
assistance then become reference materials for his future writing needs: “I 
may . . . go back to that polished, edited version of that paper when I’m 
writing other things” (Victor, 289-290). While he does not view reading as 
a collaboration with textual helps, he does see writing that way: “I think 
that [writing] is really a process where I’m constantly assimilating these 
building blocks that are coming in” (Victor, 594-5). In interacting with 
textual models and a native speaker who gives feedback on his writing, 
Victor says “it’s like having two or three teachers who are telling you slightly 
different things and you know all three are correct in some sense but you’re 
trying to figure out which one you want to absorb” (Victor, 660-665). 

Victor is well able to handle the linguistic situations he encounters, 
whether in oral or written contexts, getting the assistance he needs to handle 
them. He clearly is an expert at managing ZPDs such that he learns new 
things in his encounters with these various forms of assistance. While he 
said that retention is a problem, he did report not needing some annotations 
when he goes back to a previously read article. He sees his preference for 
textual sources of help is part of his personality “I’m just more comfortable. 
I’m kind of shy and. . . ’m just much more comfortable puzzling something 
out on my own using whatever materials I have to help me” (Victor, 
510-510). He said that he may be missing something by not also seeking to 
learn via social interaction: “I’m not convinced that acquiring [new 
vocabulary] through reading instead of through conversation is better. In 
fact, I often think its not better, but I’m more comfortable in that mode and 
I’m sort of willing to sacrifice some efficacy for my own comfort level, if 
that makes sense” (Victor, 540-552). This is an interesting comment 
because Victor is highly proficient in both spoken and written Chinese, so his 
strategies have clearly been successful. Whether or not other strategies 
might have been more effective is unclear; what is clear, however, is that 
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Victor is well able to manage the difficulties he encounters by managing the 
sources of assistance he prefers to use. 


5.5 Sanjay: Sanskrit, Advanced Student 

While the other four interviewees are all speaker/readers of modern 
Asian languages, Sanjay, a graduate student, is a learner of Sanskrit, a 
classical Indic language. While he is able to learn something of Sanskrit 
through analogy with Hindi, his native tongue, in general his only way to 
access and learn Sanskrit is through written texts. He does not have the 
opportunity to develop proficiency via social interaction in Sanskrit. Such 
opportunities are available through special programs, usually offered in the 
summer, where “spoken Sanskrit” is taught. Sanjay has not yet had the 
experience of attending such a course, and his aural access to the language is 
only via hearing the Sanskrit read aloud, even if he is the one doing the 
reading. 

Even though Sanskrit is a reading language for Sanjay, he laments the 
lack of opportunity to approach the language conversationally while 
acknowledging his indebtedness to Hindi as giving him some access to the 
meter, phonology and structure of Sanskrit. Sanjay states: 


You know, in theory you can have conversations in Sanskrit but that’s not how I 
approach the language which is to my detriment, I think (Sanjay, 17-21). The 
interaction, the ability to speak the language to each other, or even hear other people 
speaking it or even reading it out loud, I think there’s an element that I find very 
helpful and when I move away from that I find I can regress. (Sanjay, 289-295). 


One of Sanjay’s goals is to become a fluent speaker of Sanskrit. 

However, Sanjay said that his main method of learning the language is 
textual. He describes a process of analyzing and understanding texts that 
involves interacting with a range of textual helps including dictionaries, 
translations, commentary, criticism, and professors. Sanjay describes this 
process: 


So you have one word that in different contexts has multiple meanings and 
unfortunately the dictionaries . . . don’t necessarily give it to you by the order of 
frequency and when the dictionary is 200 years old you know they didn’t know it back 
then .... And sometimes the process of looking up a word requires to you to think 
“can this be a possible meaning” or sometimes . . . you can actually come up with an 
interpretation that may be different from the commonly accepted interpretation 
(Sanjay, 187-202). Dictionaries and translations help you to a point but then you 
have to almost interact with the genre itself. ...You get into the dialogue and a 
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second order dialogue about interpretation and .. . carry it from there” (Sanjay, 
260-280). 


In addition to his dialogue with textual sources—the original text, 
dictionaries, commentary, etc., he also has interaction with professors and 
sometimes with people who use the text being studied as scripture. He said 
that he photocopies the text he is studying, annotating the photocopy and 
eventually reading by referring to the original book which is not annotated. 
This technique involves managing ZPDs—allowing for assistance via 
dictionaries and annotation, and then withdrawing the assistance by reading 
the unannotated original. In this way, his method of studying fosters 
learning and more independent reading. 

Sanjay views his interpretation of texts in Sanskrit very much as a 
collaboration between himself, the text, and his various sources of assistance. 
He is part of an ongoing dialogue between all of these sources, a 
conversation both ancient and modern. Through this process he constructs 
a meaning of the text being read and learns the language through this process 
of dialogic help-seeking and interpretation. 


6. Discussion 

Interviews with the five learners showed them to flexibly seek 
assistance from both interpersonal and textual sources based upon their own 
goals and the demands placed upon them. With the exception of Mark, who 
was enrolled in a Japanese course that he found to be much too difficult, all 
seemed quite capable at tailoring their help-seeking behavior to the nature of 
the problem encountered and the likelihood of finding assistance from a 
particular source. From the five interviews, it seems fairly clear that the 
ZPD, whether or not each person fully grasped the concept, is a part of the 
cognitive functioning of each of the five learners. Language proficiency 
seems to have less to do with this than the fact that all are cognitively mature 
adults. However, Niles, Evelyn, Sanjay and Victor, all of whom were using 
appropriately challenging materials, seemed better than the Mark at 
assessing the quality of assistance they might receive from particular sources, 
thus tailoring assistance appropriately to each situation. Because the 
reading tasks Mark was attempting were too difficult, he also had difficulty 
with the assistance he received. When interviewing him, I got the 
impression that rather than trying to learn everything, he was mainly just 
trying to get through the class. While the other four interviewees seemed 
masters of their language learning processes, Mark’s efforts seemed more a 
matter of academic survival. 

In terms of internalization, the interviewees reported that help received 
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from both person-to-text and person-to-person interfaces had potential to be 
internalized, with some learners expressing a preference for a particular 
modality either in general or depending upon the situation and what had 
already been effective or not the process of solving a particular linguistic 
problem. In addition, learners reported preferences for particular modalities 
based upon the effectiveness of a source of help in helping them to retain 
information or develop language proficiency. Differences in learning styles, 
however, do not seem to impact the learners’ expertise at creating ZPDs for 
themselves as they interact with L2 materials. Learners also reported using 
strategies to avoid seeking assistance when it wasn’t necessary, such as using 
context or character analysis to determine word meaning. Development in 
the ZPD involves incorporating assistance for challenging situations, with 
withdrawal of assistance to encourage internalization. Learners managed 
this for themselves, noting that they no longer needed to rely on annotations 
or vocabulary lists, for example, seeing this as evidence of learning. 

For an adult to assist a child’s cognitive growth in the ZPD, the adult 
must 1) assess the child’s skill with respect to the particular task, 2) tailor 
assistance to that level, 3) adjust the nature of assistance as needed, 4) 
withdraw assistance as the child’s skill grows. This sort of assistance is, to 
use Lantolf and Aljaafreh’s words, “graduated and contingent” (p. 468). 
Management of assistance in the ZPD has been considered to be something 
performed by the tutor. However, the more capable the one being assisted, 
the more it becomes possible or even likely that the one seeking assistance 
May Manage more and more of these functions. An autonomous adult 
learner may operate in a completely self-managed way regarding how 
assistance is sought and obtained. Other-management of assistance and 
self-management of assistance are described in Table 4. 


Table 4. Assistance in the ZPD: Other-management and Self-management 


Other-management Self-management 

Assess helper assess learner needs learner assesses own needs 

Tailor helper provides developmentally learner chooses appropriate modality 
appropriate assistance help (person, bound or _ on-line 


dictionary, Google, reference book, 
textbook, etc.) 


Adjust helper adjusts assistance in Learner adjusts source of help as 
response to learner needs needed to solve problem. 
Withdraw Adult provides less assistance as Learner seeks less assistance as skill 
learner skill grows grows, or avoids previously created 
helps 
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Tutoring may involve extensive other-management. However, adult 
learners may take on one or more of these management tasks, depending 
upon the nature of assistance sought. When the source of assistance is 
textual, the adult may manage all of these areas. Even when seeking 
assistance from another person, the adult is likely to manage some of these 
functions for him or herself, with the level of other-management dependent 
upon the nature of help requested, the learner’s willingness to share the 
management task, the developmental sensitivity of the helper, the helper’s 
level of commitment and interest, and the time available for help to be 
provided. 

The findings of these interviews are consistent with the idea that the 
ZPD may be internalized. The learners studied here show the ability to 
seek, tailor, and withdraw help, using a variety of sources depending on their 
personal preferences. With the exception of one learner who was forced to 
study materials that were too difficult for him, the learners studied have 
developed the ability to manage ZPDs to optimize their language 
development. These learners learn via interaction with various helps, 
combining consultation with native speakers with use of various textual 
sources. They ably manage both the sources of assistance and the type of 
assistance sought, making decisions about what they want to learn or focus 
on. Some arrange for withdrawal of assistance, for example for reading 
without annotations at a future time. Expertise at managing zones of 
proximal development is evident in the experiences of the adult language 
learners studied here. 

In conclusion, the highly literate adult has not only learned to read and 
write, but also to use written sources in addition to interpersonal ones to 
create zones of proximal development. Emerging initially in interaction 
between a child and the adults who interactively structure the child’s social 
world, over time the child becomes able to generalize the ZPD, learning 
through a growing variety of interactive settings, with adults and peers. 
While the ZPD is primarily managed by adults in interaction with small 
children, as children develop they are able more and more to manage their 
own ZPDs, working to seek assistance and to manage the level of assistance 
provided such that they can meet the challenges they encounter. This 
growing ability to manage the ZPD is, I propose, a mark of how the ZPD 
itself is slowly being internalized. As literacy is established, the ZPD is 
further internalized so that children become able to carve out developmental 
spaces, as they interact with written materials, initially with the assistance of 
another, working more and more independently over time. Internalization 
involves a gradual increase in the individual’s skill in managing their own 
ZPDs and creating developmentally rich settings for themselves. The 
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interviews conducted here show how adult learners are able to tap into the 
ZPD to manage their own learning. More research on learners of other ages 
and in other situations is needed to better see how this internalization 
develops. 

More research is also needed to determine whether the ZPD is an 
appropriate unit of language development across the lifespan. However, 
partial evidence is promising. While at first managed by another, the ZPD 
is finally managed by the learner him or herself. It begins as something that, 
in itself, is not part of the human organism, but must be internalized via the 
social interaction processes through which it is formed. In the end, the ZPD 
itself becomes a tool used by cognitively mature adults as they go through 
life seeking new skills and new knowledge, including knowledge of and skill 
in foreign languages. 
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Appendix. Interview questions 


1) 
2) 
3) 
4) 


5) 


6) 


7) 


8) 


9) 


What methods do you use to study [your language of specialization]. 
How have those changed as you became a more proficient speaker? 
How have those changed as you became a more proficient reader? 
What is the role of native speakers or others who are more proficient 
in your language of specialization than you are? Do you make use 
of their expertise? Ifso, how? How has that changed over time? 
How do you handle linguistic situations (speaking, listening, reading 
or writing) that are beyond your individual ability now? How is that 
similar to or different from how you handled such situations when 
you were less proficient? 

Are you familiar with the term “Zone of Proximal Development? 
(ZPD)” If not, would you like an explanation? 

How does the concept of the ZPD fit with your experiences learning 
and studying your language of specialization? How does it diverge? 
How do you feel your ability to work within your ZPD has changed 
as your language proficiency has improved? What are the “gaps” 
you encounter today when working with your language of 
specialization? How do they differ from those encountered earlier 
in your language learning career? 

Internalization in the ZPD is considered something that moves from 
the “between people” space to being an individual resource. Do you 
see similarities or differences with the process of learning language 
from textual sources? 
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Manual Introduction of Animate Referents 
in L2 Narrative Discourse 


Keiko YOSHIOKA 


0. Introduction 

In order to produce understandable and meaningful discourse, it is 
imperative for the speaker to clearly mark the identification of referents that 
are newly introduced into the story so that the listener can follow the 
development of the storyline in the rest of the story. The speaker may use 
various linguistic measures to achieve this. In the following sections, we 
explore the manner in which hand movements in conjunction with speech 
mark the introduction of new referents in the production of discourse. 


1. Introduction of referents in speech’ 

It has been suggested that the speaker indicates the degree of 
presupposition expected of the listeners for the interpretation of the identities 
of the intended referent by marking the informational status of the referents. 
Various notions and views on the information structure in discourse have 
been proposed (e.g., Ariel 1988; Chafe 1976, 1994; Clark & Haviland 1977; 
Givon 1983, 1984; Halliday & Hasan 1976). 

In this section, we will briefly review Givoén’s standpoint (Givén 1983, 
1984) since it has been widely adopted in studies of referential introduction 
and tracking in both L1 and L2. Framing the information structure using the 
notion of topic continuity, Givén (1983) argues that a topic continuing from 
the preceding clause will be more predictable and, as a result, may be easier 
for the listener to process than new or reintroduced topics. Thus, in discourse, 
linguistic devices used to mark new topics differ from those used for more 
predictable topics. Givon illustrates the relationship between the various 
indexical markings and the new and given informational status of a referent, 
as shown in Figure 1. According to this hierarchy of forms, zero anaphora 
shows the strongest relationship to continuous topics, while full NPs are 
most likely to be related to discontinuous topics. Givén (1984) explains this 
phenomenon through the quantity universal principle. According to this 
principle, “more continuous, predictable, nondisruptive topics will be marked 


1 4 : : ‘ 
In a narrative, the speaker usually introduces multiple referents, both animate and 
inanimate. However, we will limit our discussion to the introduction of animate entities. 
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by less marking material, while less continuous, unpredictable/surprising, or 
disruptive topics will be marked by more marking material” (Givén 1984: 
126, italics in the original). 


more continuous/accessible topics 


zero anaphora 
unstressed/bound pronouns 
stressed/independent pronouns 
full NPs 


more discontinuous/inaccessible topics 


Figure 1. Topic continuity and referential forms (based on Givén 1983) 


Giv6on’s quantity universal principle has been investigated using various 
languages, and there is a general consensus in the literature that it is correct 
(Clancy 1980; Givén 1983, 1984; Hinds 1983; Marslen-Wilson, Levy, & 
Tyler 1982). 

Figure 2 shows the manner in which differences in the informational 
status of referents may be encoded in the choice of referential forms in Dutch 
and Japanese. In Dutch, the form related to the most discontinuous topic is a 
full NP with the indefinite article een. Following this is an NP with the 
definite article het, then a pronoun and, finally, zero anaphora (@). The 
Japanese language basically contains two forms, namely, NP and zero 
anaphora. 


NEW<(discontinuous) < > GIVEN(continuous) 
een kikker het kikker hij a) 

a frog the frog he 

kaeru kaeru i) g 

frog frog 


Figure 2. Informational status and referential forms in Dutch and Japanese 


In many Indo-European languages, as in Dutch, articles are used to 
mark the newness of referents. However, such marking of newness using 
article systems is not a universal practice. These devices are not available in 
languages such as Japanese, Polish, Chinese, and Finnish. Accordingly, other 
means are adopted for marking the informational status of referents. For 
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instance, in Japanese, numeral classifiers such as h(p)iki (a counter for 
animals) may be used in place of an indefinite article, although this is not 
obligatory. In addition, the informational status of the referent may be 
reflected in the choice of the postpositional particles. Studies indicate that in 
languages such as Japanese and Korean, the use of the nominal particle is 
associated with the introduction of a referent (Hinds 1983; Nakahama 2003; 
Ono, Thomson, & Suzuki 2000). Thus, although the speakers may indicate 
the referents being introduced for the first time in a manner different from 
their subsequent mentions irrespective of the language they speak, the 
linguistic devices they use for marking the newness of the referents may 
reveal cross-linguistic variation. This variation appears to affect the marking 
of newness in L2; this is discussed in the following section. 


2. Introduction of animate referents in L2 

While there exist a considerable number of studies investigating 
reference management in L2, many focus on how learners track referents, 
and not on how they are introduced (e.g., Hendriks 2003; Jung 2004; Kumpf 
1992; Polio 1995; Williams 1988). With respect to reference introduction in 
L2, it has been suggested that the availability of newness markers in the L2 
speakers’ source language (henceforth SL) may influence the manner in 
which they mark the newness of referents in the target language (henceforth 
TL). For instance, Jarvis (2002) compared the manner in which Swedish and 
Finnish learners of English use articles to mark newly introduced referents. 
While Swedish has an article system, Finnish does not. The results reveal 
that Swedish learners are more successful than their Finnish counterparts in 
using English articles for marking newness. 

The interaction between the SL and the TL in newness marking is also 
suggested by Nakahama (2003), who compared Korean and English learners 
of Japanese with regard to the acquisition of postpositional ga as a newness 
marker. Nakahama found that although the acquisition of ga interacts with 
proficiency in both Korean and English learners as compared with the 
English learners, the Korean learners use ga as a newness marker in a 
manner more similar to that of the TL speakers. 

Furthermore, studies show that when the form used for marking the 
newness of referents in the SL is not available in the TL, the learners may 
seek other linguistic means to perform the same function (Jin 1994). In Jin’s 
study of English learners of Chinese, a language without articles, the learners 
overused demonstratives in combinations with NPs to denote reference 
maintenance. Jin interprets this as an influence of the SL. He argues that 
English learners may have relied on demonstratives in Chinese to perform 
the function of definite articles. 
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3. Gestures in narrative discourse 

Although gestures have long been viewed as a part of “nonverbal” 
activities, in this study, they are defined as hand and arm movements that 
co-occur with speech. With the exception of emblematic gestures such as the 
OK or thumbs-up signs, co-speech gestures (henceforth “gestures”) are 
mainly performed without conscious efforts on part of the speaker. One of 
the characteristics of gestures is that they are frequently semantically and 
temporally synchronous with the accompanying speech. Accordingly, those 
who adopt a psycholinguistic approach to gestures, as adopted in this study, 
claim that these gestures form an integral part of language use (McNeill 
1992). 

This relatively recent approach has resulted in various findings that 
reveal the intricate relationship between speech and gestures. According to 
the findings that are most relevant to the present study, some gesture 
performances may be related to the structure of information in discourse 
(Furuyama 2000; Gullberg 1998; 2003; Levy & Fowler 2000; Levy & 
McNeill 1992; Marslen-Wilson, et al. 1982; McNeill 1992). These findings 
suggest that newly introduced referents are more likely to be accompanied 
by gestures than preintroduced referents. In fact, McNeill (1992) adopts 
Givon’s scale of linguistic quantity to explain gesture performance. The 
basic assumption underlying McNeill’s view is the same as that adopted by 
Givon: when the information is predictable (given), there is less need for 
coding material. However, when the information is unpredictable (new), the 
speaker will provide more complex material through both speech and 
gestures. Various findings support this view (Gullberg 1998; 2003; Levy & 
Fowler 2000; Levy & McNeill 1992; Marslen-Wilson, et al. 1982; McNeill 
1992). An interesting question is whether this rule applies to L2 gesture 
production. 


4. Gestures in L2 

In the studies of L2 acquisition and use, gestures have been 
systematically framed in the taxonomies of communication strategies (CS) 
(Dornyei & Scott 1997; Faerch & Kasper 1983; Gullberg 1998; Kellerman, 
Bongaerts, & Poulisse 1987; Paribakht 1985; Tarone 1977). Conventionally, 
L2 gestures were viewed as a nonverbal strategy performed primarily to 
replace missing lexical items (Tarone 1977). More recent accounts describe 
gestures beyond this lexical substitute perspective (cf. “replacing” and 
“accompanying” gestures in Paribakht 1985), although miming continues to 
be the dominant focus. 

Consequently, a very limited number of studies have focused on the use 
of gestures by learners per se, particularly in relation to discourse. In one of 
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these studies, Gullberg (2003) examined the referential use of gestures in the 
narrative discourse of adult learners. Three informational types of referents 
were identified, namely, introduced, maintained, and reintroduced. Gullberg 
found that the performance of French learners of Swedish and Swedish 
learners of French were similar to that of natives with respect to the 
frequency of the gestural marking of introduced and reintroduced referents, 
while the gestural marking of maintained referents was considerably more 
frequent in L2 than in L1. Gullberg explains that the learners’ over-marking 
of maintained referents using gestures reflects their speech where they tend 
to over-mark these referents with NPs. The overuse of NPs in L2 discourse 
may most likely be due to the difficulties associated with using complex 
pronominal systems. Based on these results, Gullberg concludes that mastery 
of grammar may affect learners’ gesture production. 

Gullberg’s target languages (Swedish and French) are similar in that 
both languages have article systems that can be used by their speakers to 
mark the newness of referents. What remains unknown is how learners with 
typologically different SL and TL approach the TL conventions for 
referential introduction. In the present study, we focus on the introduction of 
referents by Dutch learners of Japanese. Dutch and Japanese show a 
cross-linguistic variation in the availability of linguistic measures to encode 
the informational status of referents. Thus, the SL-TL combination with 
these two languages is considered to be ideal for examining the following 
questions: 


1. Do Dutch learners of Japanese manually mark the introduction of 
animate referents in discourse? If so, does their manual reference 
introduction differ from their SL and TL norms? 

2. How can the findings be explained? 


5. Method 
5.1. Participants 

Twenty native speakers of Japanese and twenty Dutch learners of 
Japanese (who supplied both Dutch and Japanese data) participated in the 
study. However, since some speakers did not use gestures, usable data were 
provided by 15 L1 Japanese, 12 L1 Dutch, and 15 L2 Japanese narratives. 

Of the fifteen Dutch learners of Japanese, twelve were male and three 
were female. Their average age was 25.4 years. All the Dutch participants 
were multilingual. Apart from English, they had studied French and German 
for several years. It was considered crucial to match the nonnative 
participants in terms of the number of hours of instruction and the amount of 
time spent in the country where the target language was used. In order to 
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fulfill this requirement, all the nonnative narrators were recruited from the 
learners of Japanese who attended the same intensive course at a university 
in the Netherlands. None of the students had been provided with any 
instruction in Japanese prior to the course. All the nonnative participants 
received the same instruction from the same instructors for the same number 
of hours. In order to establish Japanese language proficiency, the L2 
narrators took Level 3 of the Japanese Language Proficiency Test (JLPT) 
administered by their language instructor one month prior to L2 data 
collection; the results of this test reveal that most of the learners possessed 
the relevant proficiency (roughly speaking, “intermediate”). 

Of the fifteen Japanese speakers, seven were male and eight were 
female. Their average age was 27.2 years. None of the Japanese participants 
were capable of communicating in other foreign languages. All the Dutch 
and Japanese speakers were right handed. All the subjects gave the 
researcher permission to use the filmed material for scientific purposes. 


5.2. Task and procedure 

The task employed for eliciting data was story retelling using a 
wordless picture book, Frog, where are you? (Mayer 1969). The book 
contains 26 separate frames and describes the story of a little boy and his dog 
who set out in search of the boy’s pet frog, which has escaped. They 
experience various adventures on the way and are finally reunited with the 
frog. 

The participants were filmed individually. They were provided with a 
copy of the picture book and were asked to memorize the story as thoroughly 
as possible so that they could retell it to a third person who did not know the 
story. No time constraint was placed on memorizing the story. When the 
participants decided that they were sufficiently prepared, they retold the 
story to a native listener. For the Dutch participants who provided the 
narratives in L1 and L2, the effect of the order of language used for the 
retelling task was minimized either by providing a long duration between the 
two data collections (ten months) or by counterbalancing the order. 


5.3. Coding of the gesture data 

The gestures that accompanied any newly introduced characters in 
speech were identified and coded. The different phases of gestures 
(preparation, pre-stroke hold, stroke, and post-stroke hold)* were noted for 
each gesture. The gestures that appeared to be produced in order to elicit 
assistance from the native listener were excluded from the analysis. The 


> For definitions of these gesture phases, refer to Kendon (2004) and McNeill (1992) 
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reliability of the coding was established by using a second coder who was 
trained by the author. The inter-rater agreement for the gesture data was 
92%. 


6. Results 
6.1. Introduction of animate referents in speech in L2 

Since gesture performance may be interrelated with speech production, 
we will first briefly analyze how learners mark the newness of referents in 
speech. Nearly half the L2 narrators give the boy either Dutch (e.g., Jan) or 
Japanese (e.g., Nakata) proper names. However, these names are not used for 
other characters. Unfamiliar animal terms are replaced by generic 
expressions. For instance, bees are often described as chiisai doobutsu 
(“small animals”) while the deer is frequently introduced as ookii doobutsu 
(“big animal”). Unlike in L1 Japanese, numeral classifiers or expressions 
with aru (“a certain/one”) are rarely used in L2 Japanese. 

Table 1 compares the forms used for referent introduction in L1 and L2 
narratives. 


Table 1. Forms used for referent introduction in L1 and L2 


NP-ga NP-o = _NP-ni/kara NP-copula NP-to NP-mo NP-wa NP-others’ 
(NOM) (ACC) (AGENTIVE) (with) (also) (TOP) 
Ll 68 10 13 12 Ly 4 1 2 
Japanese (53%) (8%)10 1 (8%) (10%) (10%) (13%) (3%)  (<1%) (2%) 
N=15 
L2 25 17 0 2 17 6 23 5 
Japanese (26%) (18%) (0%) (2%) (18%) (6%) (24%) (5%) 
N=15 


One of the striking differences between L1 and L2 is the use of the topic 
marker wa. Native speakers of Japanese use ga for 53% of the newly 
introduced referents; they rarely use wa for newness marking. On the other 
hand, L2 speakers use both ga (26%) and wa (24%) to mark the newness of 
referents. The frequent use of wa for referent introduction by L2 narrators 
suggests that the learners do not differentiate between wa and ga with respect 
to their association with the informational status of referents. For example, in 
(1), the newly introduced referent “big bird” is marked by the topic marker 
wa despite its new informational status. 


3 Others include ellipsis. 
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(1) ookina tori wa otooto Ce) bikkuri *shimashita (sasemashita) 
big bird TOP little brother ACC  surprise-PAST 
A big bird surprised my little brother 


The frequent use of wa indicates the difficulty in learning the pragmatic 
use of the postpositional particle ga. This is in accordance with the findings 
by Nakahama (2003), where the English learners of Japanese used ga 
considerably less frequently than the native speakers. Furthermore, the 
present results parallel the findings from the studies of child L1 learners of 
Japanese. Nakamura (1993) demonstrates that 3-year-olds are likely to 
introduce referents without a particle; further, the use of ga to mark new 
information only stabilizes after the narrators reach the age of 7 (1993: 
91-92). 

According to Table 1, L1 and L2 Japanese speakers also differ with 
respect to the frequency of the use of the agentive ni. Since this use of ni 
mostly appears in passive constructions, the fact that none of the L2 narrators 
use the particle may reflect their avoidance or ignorance of this complex 
construction. 

Furthermore, speakers of L1 and L2 Japanese differ in the extent to 
which they highlight the new referents pragmatically. For instance, fillers 
such as eeto (“ehm”) or nanka (“well”) and expressions such as mazu (“first, 
at first, in the first place”), saisho ni (“at first”), or toriaezu (“the thing is”) 
are uttered before the introduction of the primary and secondary referents in 
L1 Japanese as if to prepare the listener for the new information that follows; 
this has been demonstrated in (2). 


(2) eeto, nanka, mazu, hajimeni shonen ga ite, 
INJ _INJ in the first place first ofall boy NOM _ exist:TE 
De moo _ i-ppiki inu ga ita no 


then more one-CLS dog NOM exist:PAST SE 
Ehm, well, in the first place, the thing is, there was a boy, 
and then there was also a dog 


In addition, Japanese speakers often use the pragmatic particle to 
highlight newly introduced referents. The most frequently used particle is ne, 
which functions as a confirmation marker. This particle—which appears after 
the introduction of the primary and the secondary referents—prompts 
reactions from the listeners, who express their active involvement by 
backchannelling, nodding, or a combination of both. Refer to (3). 
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(3) nanka, otoko no ko ga ne 
INJ male GEN child NOM _ PP 
well, a boy, you see, 


“ ” 


un 
(NL) yeah 


bin no naka_oni kaeru o katteta no 
jar GEN inside DAT frog ACC keep-ASP:PAST SE 
kept a frog in a jar 


Furthermore, some Japanese speakers repeat the introduction of the 
same referents as though a single mention of an NP does not appear to be 
sufficient for establishing the identity of the newly introduced referent. The 
pragmatic use of language mentioned above is characteristically observed in 
the introduction of characters with high referential importance. In contrast, 
the Dutch speakers rarely used pragmatic measures to highlight the newly 
introduced referents either in their L1 or L2. 


6.2. Manual introduction of referents in L2 
6.2.1. Quantitative analysis of gestures accompanying introduced animate 
referents 

In order to analyze the acquired data, we compared the gestures 
accompanying the introduction of L2 with an L1 baseline (both L1 Dutch 
and L1 Japanese). We tallied every gesture accompanying the introduction of 
a new referent in speech. A total of 59 gestures occurred with the introduced 
referents in L2 production. Figure 3 presents a comparison of the results of 
L1 Dutch, Japanese and L2 Japanese. Sixty-two percent of the referent 
introductions in speech are accompanied by gestures in L2, while the figures 
are twenty-five percent for L1 Dutch and forty-six percent for L1 Japanese. 

A repeated measures analysis of variance revealed that there was a 
group effect in the frequency of the gestural marking of newly introduced 
animate referents (F(2, 39) = 6.78, p <.003). Post hoc analyses using the 
Dunnett T3 post hoc criterion for significance indicate that the average 
frequency of the gestural accompaniment of newly introduced animate 
referents is significantly higher in L2 Japanese than in L1 Dutch; however, 
no significant difference is observed in the frequency of gestural marking 
between L1 Japanese and L2 Japanese, as shown in Table 2. 
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MILID N=12 
MLid N=15 
WIL2J N=15 


Intro 


Figure 3. Frequency of the gestural accompaniment of newly introduced animate referents 


Table 2. Multiple comparison of the gestural accompaniment of newly introduced referents 


(Dunnett T3) 
mean difference std. error sig. 
L2 Japanese L1 Japanese 142 .100 .412 (n.s.) 
L1 Dutch 364* .096 .003 


Interestingly, when the gestural markings by the three groups of 
speakers are compared in detail, a different pattern emerges. We analyzed 
whether the frequency of the gestures would differ according to the 
referential importance of the characters (i.e., characters are labeled primary, 
secondary, and peripheral according to the importance of the roles that they 
play in the story). Figure 4 shows the results. 


MELID N=12 
MELIJN=15 
ML2J N=15 


primary & secondary peripheral 


Figure 4. Gestural accompaniment of newly introduced referents in L1 and L2 
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A repeated measures analysis of variance with one 3-level between- 
subjects factor (language group) and one 2-level within-subjects factor of 
character type (primary & secondary/peripheral) revealed a significant main 
effect of group (F(2, 39) = 6.47, p < .005) and an interaction between 
character type and group (F(2, 39) = 8.10, p < .005). In order to address the 
question of whether the L2 group differs from the L1 group statistically, we 
compared primary/secondary and peripheral characters using paired samples 
t-tests. The results are presented in Table 3. They reveal that the frequency of 
gestural marking of primary and secondary characters was significantly 
different between L1 Dutch and L2 Japanese, but not between L1 Japanese 
and L2 Japanese. On the other hand, the frequency of the gestural marking of 
new referents in L2 Japanese is significantly higher than that in both L1 
Dutch and L1 Japanese. 


Table 3. Comparison of the gestural marking of primary/secondary and peripheral referents 


primary & secondary peripheral 

df t sig. df t sig. 
L1 Japanese vs. L2 Japanese 14 .976 073 14 —2.668  .049* 
L1 Dutch vs. L2 Japanese 11 -6.504 —.000* 11 —2.217 — .018* 


The following points regarding the results appear noteworthy. First, the 
frequency of gestural marking of the newly introduced referents increases as 
the speakers switch from their L1 to L2. Second, of the two L1s, speakers of 
L1 Japanese produce gestures considerably more frequently than their Dutch 
counterparts to mark the introduction of primary and secondary referents. 
However, the difference in the frequency of gestures between the two L1 
groups diminishes with respect to the peripheral characters. Third, referential 
importance does not appear to have a significant effect on the L2 narrators’ 
gesture performance. More than half of the newly introduced referents are 
gesturally marked irrespective of their referential importance. In the 
following section, we will analyze some qualitative aspects of gestures 
accompanying the introduction of animate referents. 


6.2.2. Qualitative analysis of the manual introduction of referents in L2 

In Figure 5, the speaker manually marks the introduction of the boy and 
the frog in L2. It should be noted that these gestures are not produced to 
compensate for the lack of lexical items. The speaker appears to have a 
knowledge of the two words that are accompanied by the gestures, otoko no 
ko (boy) and kaeru (frog). Nevertheless, two gestures are performed to mark 
the introduction of referents. They are produced in a similar location in the 
gesture space (i.e., in front of the speaker) with a similar hand shape (Figures 
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5-1 and 5-2). 

With respect to speech, the speaker uses the noncanonical expression, 
“otoko no ko wa kaeru ga imasu” (lit: As for the boy, a frog exists). While 
this expression is frequently observed in the introduction of primary/secondary 
characters in L2, it is not observed in the L1 Japanese narrative. Another 
interesting point to be noted is that the dog is typically introduced after the 
introduction of the boy and the frog in L2, although this referent is frequently 
introduced with the boy before the frog in L1 Japanese. We will address these 
points later. 


1 L7: eto [otoko no ko wa] nn / eto 2 [kaeru_/ga_ /imasu] 
1 L7: INJ [male ACC child TOP] INJ / well 2 [frog / NOM “ exist: NONPAST] 


Well, the boy well has a, frog, frog (lit: As for the boy, a frog exists) 
Figure 5. Gestures accompanying the introduction of the boy and the frog 


Figure 6 shows that the same speaker introduces the boy and the frog at 
the beginning of his L1 narrative. The speaker states that the story is about 
the boy. The introduction of the referent is not marked by a gesture. In the 
succeeding clause, the frog is introduced as the object of the verb hebben 
(“to have”) and is marked by a two-handed gesture. In both L1 and L2, the 
speaker introduces the frog as the boy’s possession. 


Mijn verhaal gaat over een jongen 

Die heeft een [kikkertje in ]z’n vaasje 
My story is about a boy 

He has a little frog in his jar 


Figure 6. Gesture accompanying the introduction of the same referent in L1 
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In L1 Japanese, the boy, the dog, and the frog are frequently introduced 
as a collective subject of the existential verb, iru (exist). The accompanying 
gestures place the characters in different locations in the gesture space. 
Interestingly, native speakers of Japanese frequently contrast the boy and the 
dog visually in the gesture space.’ For instance, in Figure 7, the speaker first 
locates the boy in the right center of the gesture space with his right hand 
(Figure 7-1). Following this, the position of the right hand is sustained, while 
the dog is located in the left center of the gesture space with his left hand 
(Figure 7-2). Thus, the two main characters, the boy and the dog, are located 
in symmetrical positions. Only then is the frog marked manually in the centre 
of the gesture space, represented by a gesture depicting an entity, most likely 
the jar (Figure 7-3). 

In comparison with speakers in L1 Japanese, speakers in L2 Japanese 
are considerably less likely to visually contrast the boy and the dog. The 
qualitative difference in the gestures accompanying the introductions of the 
boy and the dog in L1 and L2 Japanese appears to reflect the difference in 
the manner in which the two characters are introduced in speech. As 
mentioned earlier, in L1 Japanese, the two characters (and sometimes the 
frog) form the collective subject of the existential expression. They are 
introduced as partners, and their relationship is reflected in the 
accompanying gestures. On the other hand, speakers in L2 frequently 
introduce the frog and the dog as the boy’s possessions using the 
noncanonical expression, as in “otokonoko wa kaeru/inu ga iru.” In this 
expression, the relative referential importance of the boy and the other 
animals is immediately clear: the boy plays a dominant role in the story. 


1 2 3 

1 [otokonoko to sorekara] 2 [inu to] 3 [ato bin ni_haittakaeru__kaeru ga __ite] 
1 [boy and then] 2 [dog and] 3 [also jar DAT enter-ASP:PAST frog exit:TE] 
There is a boy, and then a dog, and also a frog in a jar, and 


Figure 7. Gestures accompanying the introduction of animate referents 


* Gesture space is defined as the space in front of the speaker (McNeill 1992). 
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The lack of partnership between the boy and the dog is reflected in the 
gesture, where the speaker does not locate the referents in a symmetrical 
manner. For instance, the speaker referred to in Figure 5 introduces the dog 
as the boy’s possession in his speech. Although the referent is marked by a 
gesture, the gestural pattern indicates qualitative differences from the norm 
of the TL. A two-handed gesture depicts a round entity, and the boy and the 
dog are not contrasted in the gesture space. 

Interestingly, some characters in the story show more variation among 
the speakers than others with respect to the manner in which they are 
introduced manually. Individual variation is frequently observed with regard 
to the introduction of the dog. For instance, some speakers locate the referent 
with a single hand. On the other hand, others may use two-handed gestures. 
The use of a counting gesture was also observed. 

In contrast, there is relative agreement among the speakers with regard 
to the gestures accompanying the introduction of the frog. A majority of 
these gestures iconically represent an entity with one or both hands in front 
of the speakers. They typically iconically depict a round entity. 


Figure 8. The first three scenes of Frog, where are you? (Mayer 1969) 


These gestures may represent the frog as well as the jar, reflecting the 
picture where the frog is located in the jar. The differences in the degree of 
agreement between the gestures accompanying the dog and the frog may 
occur due to the ways in which the referents are introduced in the original 
picture book (see Figure 8). The location of the frog at the beginning of the 
story is important; this is because the entire story begins with the frog’s 
escape from the jar. In contrast, the location of the dog at the beginning of 
the story does not contribute significantly to the rest of the story. This 
difference in the importance of the location may be reflected in the variety 
and the similarity of the gestures accompanying the two referents among the 
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speakers. 

Some of the words used for the peripheral characters are unknown to 
the L2 speakers. In such cases, the learners commonly utilize general 
expressions such as ookii doobutsu (“big animal”) or chiisai mushi (“small 
insect”). Figure 9 shows that as the animal is introduced in the speech, the 
speaker moves his hands to depict the entity in the gesture space. 


L2: ato wa / n [ookii / dobutsu ga] *arimasu (imasu) 
L2: rest TOP [big / animal NOM] exist: NONPAST 


As for the rest, there is a big animal 


Figure 9. Gestural marking of the big animal 


Although the gesture stroke of the accompanying gesture often 
synchronizes with the adjective in the speech, a post-gesture hold usually 
occurs until doobutsu (“animal”) or tori (“bird”) is uttered. The hold suggests 
that the compound expression may be treated as a single entity, “big-animal,” 
rather than with an emphasis on the adjective ookii (“big”). 


7. Summary and discussion of bimodal animate reference introduction 
in L2 

One of the questions investigated in the present study was whether 
learners manually mark the introduction of animate referents. The 
quantitative analysis shows that this is indeed the case. More than 50% of the 
introductions of new animate referents in L2 are accompanied by gestures. 
Since gestures related to word-finding problems are excluded from the 
analysis, these gestures are likely to be related to the informational structure 
of discourse (McNeill 1992). While gestures are frequently associated with 
communication strategies in studies of L2 acquisition and use, the present 
study shows that L2 gestures do not merely reflect learners’ lexical problems. 

A comparison of the manual introductions of animate referents in L2 
with their L1 baseline reveals that gestures in L2 are not exactly identical to 
the norms required by the TL or the SL. Furthermore, the present study does 
not focus on the cross-linguistic analysis of gestures; however, it was 
incidentally observed that native speakers of Dutch and Japanese do not 
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manually mark the introduction of referents in the same manner. A strong 
tendency is observed in L1 Japanese to manually mark the introduction of 
those referents that have high referential importance. This finding differs 
from that obtained by Gullberg (2003), where native speakers of Swedish 
and French exhibit no difference in the manner of manual reference 
introduction. 

One possible reason for the discrepancy between the results may be the 
differences in the languages used in the two studies in terms of marking the 
informational status of referents. Both Swedish and French have article 
systems, which can be used by their speakers to clearly distinguish the new 
informational status of referents. Furthermore, these languages have 
pronouns that can encode number and gender. Thus, the gradual change in 
the informational status of referents can be expressed in various ways in 
these languages. On the other hand, the target language used in the present 
study lacks both article systems and the active use of pronouns. Without 
these linguistic devices, native speakers of Japanese may feel the need to 
firmly establish the identities of the referents at their introduction before 
switching the referential form from NP to zero anaphora. The frequent use of 
pragmatic measures may also be motivated by the same reason. 

Furthermore, gestures may prove effective for establishing the referents 
in the mind of the listener. They may heighten the attention of the listener, 
and the association between a referent and a particular gesture space can be 
used to create visual anaphoric linkage. It has been suggested that a gesture 
accompanying a new referent not only marks its introduction but also 
designates a particular locus in the gesture space for this referent (Gullberg 
1998, 2003; Levy & McNeill 1992; Marslen-Wilson, et al. 1982; McNeill & 
Levy 1993). Thus, when the same referent is mentioned at a later point in the 
story, the speaker may anaphorically point to the location associated with the 
referent, as if to re-activate it. The high frequency of gestural marking for 
characters with major referential importance in L1 Japanese may be caused 
by the fact that they have a more probable association with zero anaphora 
than peripheral characters. Japanese speakers may produce gestures to 
clearly establish the identity of referents that are likely to be associated with 
zero anaphora in the remainder of the story. On the other hand, equipped 
with various linguistic means to mark the informational status of referents, 
Dutch native speakers may not feel such a strong need for the manual 
introduction of important characters. 

With respect to L2 speakers, the motivation for the manual introduction 
of referents may be slightly different. The literature suggests the interaction 
between SL and TL in newness marking (Jarvis 2002; Nakahama 2003); it 
also suggests that if the form used for marking the newness of referents in 
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the SL is not available in the TL, the learners may seek other linguistic 
means to perform the same function (Jin 1994). The preference of the 
learners for using noncanonical expressions to introduce the boy and the frog 
in the present study suggests an interaction between the SL and the TL, 
where learners map their L1-based organization of meaning onto L2 within 
its morphological constraints. 

The L2 gesture pattern also appears to reflect the interaction between 
the learners’ knowledge of the two languages. We speculate that the absence 
of SL-like linguistic means to highlight new information and the lack of 
control of some of the linguistic devices may have motivated the Dutch 
learners in the present study to rely on the other mode of expression—the 
manual mode—to mark the newness of referents. For instance, learning the 
use of the particle ga for marking newness may be time consuming. The 
various pragmatic measures adopted by the Japanese native speakers—such 
as insertions before the introduction, extensive repetition of the information 
about the new referents, and the use of the discourse particle ne—may also 
be difficult to acquire. Since none of the pragmatic marking of referents is 
obligatory, such devices are unlikely to be taught in classrooms. Even when 
the L2 learners are exposed to such devices, they may fail to notice their 
important effects. Despite this, based on their knowledge of L1, learners may 
be aware that new information is, or should be, marked differently from 
predictable information. This may explain the high frequency of the manual 
introduction of animate referents in L2. Unlike in L1 Japanese, where the 
motivation to mark referents is related to the use of zero anaphora, learners’ 
gestures may largely be caused by the need to mark newness. Based on these 
results, we argue that the interaction between the SL and the TL may cross 
the modal boundary and be reflected in L2 gesture performance. 

Interestingly, some gestures in L2 appear not only to mark the newness 
of the referents but also to describe the content of speech. For instance, the 
gesture accompanying the introduction of a “big animal” or a “big bird” 
manually describes the size of the referent and simultaneously marks its 
newness. Thus, these gestures may have dual functions, although nothing 
definite can be stated at this juncture. 

Since the proficiency of learners was kept constant, the question 
regarding whether and how learners’ proficiency affects gesture production is 
beyond the scope of the present study. However, an interesting direction for 
future research is to examine the gestural performance of advanced learners. 

In conclusion, the results of the present exploratory study reveal that 
speech-associated gestures can be studied as a useful tool for the 
investigation of the manner in which native speakers and learners create 
meanings at the level of discourse. The examination of gestures may provide 
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a fruitful direction in advancing the study of second language acquisition and 
use. 
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Figure 8. From FROG, WHERE ARE YOU? by Mercer Mayer, copy right 
© Mercel Mayer, 1969. Used by permission of Dial Books for 
Young Readers, A Division of Penguin Young Readers Group, A 
member of Penguin Group (USA) Inc., 345 Hudson Street, New 
York, NY 10014. All rights reserved. 


Appendix a. ‘Transcription conventions for Japanese utterances (based on 
Iwasaki 2002) 
ACC accusative NONPAST non-past 
ASP aspect PASS passive suffix 
CLS classifier PAST past 
COND conditional PP pragmatic particle 
COP copula Q question marker 
DAT dative QT quotative 
GEN genitive SE sentence extender 
INJ interjection SSW sound-symbolic word 
MOD modal expression TE -te (conjunctive) form 
NEG negative TOP topic marker 
NOM nominative 
Appendix b. _ Transcription conventions for gestures 
[ ] segment where a gesture occurs 
[bold] illustrated gesture 
[underline] gesture hold 
/ pause (not measured) 
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Appendix c. Raw data for the introduction of referents in speech and 


gestures 
L1 Dutch L1 Japanese L2 Japanese 
speech 87 128 95 
gesture 22 60 59 
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The Use of Phrasal Verbs by Japanese 
Learners of English — Implications from 
Story Telling Data — 


Asako YOSHITOMI 


1. Introduction 

As an instructor of English as a foreign language, I often encounter 
expressions used by students that are grammatically correct yet sound 
awkward or unnatural. Being a learner of English myself, I receive 
feedback from native-speaker editors who say, “Well, I guess you could say 
it that way, but I would probably say it this way.” The reason for this kind 
of reaction to second language (L2) use could stem from a range of 
possibilities such as an inappropriate selection of style or register (i.e. a 
matter of sociolinguistic competence), an uncommon combination of words 
(i.e. a matter of knowledge of collocation), an inaccurate choice of word (i.e. 
a matter of having comprehensive vocabulary knowledge including the 
nuance or connotation of vocabulary items), among many others. If it 
comes to face-to-face oral language use, phonological elements and 
non-verbal factors such as the use of eye-contact, gesture, and a host of 
paralinguistic elements could also contribute to giving the impression of 
unnaturalness. 

With the spread of communicative language teaching and the 
application of an increasingly multi-faceted construct of “communicative 
competence” in language testing, there is a need to discuss not only the 
fluency, accuracy, and appropriateness of language use, but also what is 
intuitively captured as the “naturalness” or “native-likeness” of language use 
and how it is acquired in second language acquisition (SLA). This involves 
an attempt to decipher what linguistic and non-linguistic features constitute 
naturalness/native-likeness and an investigation of how these features 
develop in second language learners. 

The field of SLA has recently witnessed an increasing number of 
studies involving lexical acquisition, and the use and development of 
knowledge in formulaic sequences in particular. Formulaic sequences have 
been proven to prevail in native speaker speech and are believed to be one of 
the major elements that contributes to a native-like impression of language 
use. One type of formulaic sequence is phrasal verbs. My impression is 
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that learners of L2 English seem to have difficulty using phrasal verbs 
effectively, and instead tend to use bookish multi-syllable one-words to 
express the same idea (e.g. “investigate” for “look into”). Having the 
opportunity to teach both Japanese learners who have received their foreign 
language education entirely in Japan and those who have spent some time 
abroad in an English-speaking country with a chance to acquire English 
formally and naturally, I feel a difference in the two types of learners’ 
English not only in terms of their fluency and phonological skills but also in 
terms of their use of formulaic sequences such as phrasal verbs. The 
difference seems especially conspicuous in Basic Interpersonal Communication 
Skills (BICS)-type, everyday conversational L2 use. For obvious reasons, 
such as exposure to natural everyday L2 input, Japanese ESL learners seem 
to sound more casual and natural when they speak English than EFL learners 
who are assessed as being at approximately the same “level” as the ESL 
learners but nevertheless tend to sound more formal or bookish and less 
natural. Japanese returnees, in particular, who have spent two or more 
years in an English-speaking environment before puberty have near-native 
accents of English and are often rated higher in interviews than 
non-returnees with similar grammatical knowledge. This is most likely due 
to the returnees’ ability to use discourse fillers, communication strategies, 
and native-like non-verbal behaviors in addition to their flawless accents, but 
also because of their larger repertoire of practical formulaic expressions, 
including the use of phrasal verbs, that they have acquired in natural 
language learning contexts. In essence, the returnees or ESL learners tend 
to sound more natural than the advanced level EFL learners because of these 
features. 

This paper focuses on the use of phrasal verbs in L2 speech by 
advanced adult Japanese learners of English with and without experience of 
residing in an L2 environment. I intend to explore the concept of 
“naturalness/native-likeness” by examining the L2 narrative performances of 
EFL learners and ESL learners/returnees. The issue in focus is how 
formulaic sequences, especially phrasal verbs, are used by advanced learners 
in spontaneous speech, and whether there really are differences due to the 
learners’ language learning experience. 


2. Background 

In this section, a definition of formulaic sequences is presented 
followed by a discussion of how formulaic sequences relate to language use 
and language development. Then a number of previous studies that 
examined the use of phrasal verbs by language learners are reviewed. 
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2.1. Formulaic sequences 
2.1.1. A definition of formulaic sequences 

Here the term formulaic sequences subsumes a wide range of 
phraseology used to refer to a multitude of related concepts, including 
chunks, collocations, fixed expressions, formulas, idioms, lexical phrases, 
prefabricated patterns, ready-made utterances, and routines, etc. Despite 
the widespread recognition of formulaic sequences in language use, its 
definition has remained difficult due to its diversity. Formulaic sequences 
vary in length, and have numerous linguistic and social interactional 
functions (Wray 2000). They can be completely fixed or have a number of 
open slots which can be filled with appropriate words or strings of words. 
Moreover, they differ in the degree of transparency or opacity of meaning. 
In the present study I adopt Wray’s definition of formulaic sequences: 

A sequence, continuous or discontinuous, of words or other elements, which is 
or appears to be, prefabricated: that is, stored and retrieved whole from memory 
at the time of use, rather than being subject to generation of analysis by the 
language grammar (2002; 9). 

Schmitt and Carter (2004) point out that although it is difficult to 
identify absolute criteria which define formulaic sequences, it is possible to 
list some of their typical characteristics. Formulaic sequences are in most 
cases stored and processed as unitary wholes (Schmitt & Carter 2004), they 
are spoken more fluently with a coherent intonation contour (Peters 1983), 
and processed more efficiently as single memorized units than the same 
sequence of words generated creatively both in language production (Pawley 
& Syder 1983, Kuiper 2004) and in language comprehension (Schmitt & 
Galpin 2004). Formulaic sequences may have slots that enable flexibility 
of use but the slots typically have semantic constraints. Furthermore, 
formulaic sequences may have semantic prosody (Sinclair 2004) and are 
often tied to particular language functions (Nattinger & DeCarrico 1992; 
Schmitt & Carter 2004). 


2.1.2. Formulaic sequences, language use, and language development 

It has been established in native speaker language use that formulaic 
sequences occupy a significant portion of daily language use (Nattinger & 
DeCarrico 1992, Schmitt & Carter 2004). Altenberg (1998) estimates that 
as much as 80% of the language used by an adult native speaker may be 
formulaic. According to Erman and Warren (2000), more than half of the 
spoken and written English discourse they analyzed consisted of various 
types of formulaic sequences. Likewise, Foster (2001) reports that 32.3% 
of unplanned native speech they examined was made up of formulaic 
sequences. Numerous scholars suggest that a proficient English speaker’s 
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knowledge of formulaic sequences is quite extensive and may even be of 
equal or greater significance than the lexicon of single words (Pawley & 
Syder 1983, Jackendoff 1995, Mel’cuk 1995). 

Compared to the increasing amount of study conducted on the use of 
formulaic sequences by native speakers of English, research on how they are 
acquired either in L1 or L2 is fairly modest (Weinert 1995). The 
acquisition of a formulaic sequence, like any single lexical item, is most 
likely incremental (Schmitt & Carter 2004). Although some short and 
salient formulaic sequences may be learned holistically, others are initially 
mapped phonologically as wholes but with some elements still incompletely 
grasped (Wray 2002). As acquisition progresses, the gaps are gradually 
filled in. The component words and syntactic structure may be analyzed 
and segmented later. It is also assumed that learning the appropriate 
constraints for the items that could be inserted in the slots will take a longer 
time than learning the initial frame of the formulaic sequence. Some L1 
learners seem to learn and use formulaic sequences before they have 
analyzed their constituent segments (Schmitt & Carter 2004) although it may 
depend on whether the child is more of a system learner or an item learner 
(Cruttenden 1981, Peter 1983) and on the degree of transparency of the 
formulaic sequence. The consensus is that the use and acquisition of 
formulaic sequences are motivated by the need and desire for social 
interaction (Schmitt & Carter 2004). Although formulaic sequences are 
observed among the first output in learner speech, some are not appropriately 
processed until the late teens even in L1 (Wray 1999). 

Wong-Fillmore (1976) and Bardovi-Harlig (2002) attribute a significant 
role to formulaic sequences in L2 acquisition as well. They both claim that 
formulaic sequences eventually get analyzed and contribute to the 
construction of creative rules. Thus, the acquisition of formulaic sequences 
may be essential in constructing L2 grammar. However, an opposing 
position is held by Krashen and Scarcella (1976) who insist that memorized 
utterances and creative speech are neurologically different and that there can 
be no interface between them. The well-known case study by Schmidt 
(1983), in fact, reports that his Japanese subject, Wes, used a wide range of 
memorized chunks, but those chunks did not evolve towards creative rules. 
Rather, they seemed to function as communication strategies or production 
strategies and contributed to increased fluency. 

Huang (1970, quoted in Ellis 1994) and Hakuta (1974) found that 
language learners used formulaic utterances as communication and learning 
strategies. When the use of formulaic sequences is automatized, working 
memory gets freed up (Ellis 1996), allowing for remaining processing 
resources to be allocated elsewhere, such as for generating propositional 
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speech. Since formulaic sequences are often linked to particular language 
functions in certain social situations, their knowledge is linked to pragmatic 
competence (Wood 2002) as well. Other scholars agree that native speakers 
do not seem to make full use of all the possible rule-governed sequences that 
can be generated, but rather, use particular forms for particular functions in 
specific contexts (Weinert 1995, Pawley & Syder 1983, Nattinger & 
DeCarrico 1992). In other words, the forms and usage of language may be 
quite closely linked. Therefore, even if formulaic sequences do not directly 
contribute to constructing L2 grammar, it could still be effective in 
sustaining social interaction via more efficient communicative ability, and 
thus, be beneficial to language learning indirectly. 

However, to what extent language use or acquisition relies on the idiom 
principle, that is, memorized sequence-based learning and holistic language 
processing, or the open-choice principle, that is, grammar generated, 
rule-based learning and analytic language processing (Sinclair 1991) is 
unknown. What is acknowledged, however, is that formulaic sequences are 
elements that are learned both first and last in L2 acquisition (Pawley & 
Syder 1983). They are one of the most difficult stumbling blocks for 
advanced non-native speakers, especially for those who have not started L2 
learning in early childhood (Wray 1999). Even highly proficient non-natives 
are unable to separate out grammatical but non-idiomatic sequences (Pawley 
& Syder 1983). Several studies, in fact, indicate that advanced learners 
tend to avoid using idioms and phrasal verbs in their speech (Laufer 1990, 
2000; Liao & Fukuya 2004). 


2.2. Phrasal verbs 
2.2.1. A definition of phrasal verbs 

If the effective use of formulaic sequences contributes to increased 
fluency and/or L2 development, examining the use of one type of formulaic 
sequence, phrasal verbs, is likely to provide insight into how knowledge of 
formulaic sequences develops in learner English. 

Phrasal verbs are sub-types of formulaic sequences. Moon (1997) uses 
the term “multi-word item” to refer to formulas that are institutionalized, 
fixed, and non-compositional, and lists compounds, phrasal verbs, idioms, 
fixed phrases and prefabs as subcategories. Phrasal verbs are defined as 
structures that consist of a main verb and a particle that is an adverb or a 
preposition (Quirk, Greenbaum, Leech & Svartvik 1985). Different phrasal 
verbs differ in terms of their semantic transparency. Laufer and Eliasson 
(1993) distinguish semantically transparent phrasal verbs (e.g. come out), 
semantically semi-transparent phrasal verbs (due to a metaphorical shift of 
meaning but whose meaning could be derived in context) (e.g. let down), and 
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semantically opaque or figurative/idiomatic phrasal verbs (e.g. put off). 
They found that, in multiple choice and translation tests, advanced Swedish 
learners tended to avoid idiomatic phrasal verbs compared with phrasal verbs 
that were semantically more transparent. Dagut and Laufer (1985) used a 
similar classification, distinguishing three types of phrasal verbs, and found 
that advanced Hebrew speakers of English tended to avoid using phrasal 
verbs in general compared to one-word verbs in a multiple-choice test, a 
verbal translation test, and a verb-memorizing test, and that they especially 
avoided figurative phrasal verbs. Hulstijn and Marchena (1989) conducted 
a follow-up study using three tests of similar kinds and found that Dutch 
learners of English, having similar phrasal verb constructions in their L1, did 
not avoid phrasal verbs in general. However, they did avoid idiomatic 
phrasal verbs that were perceived as very Dutch-like. Thus, decisions of use 
or non-use of phrasal verbs seem not only affected by the degree of L1-L2 
similarity but also by the learners’ perception of transferability based on 
psychotypology (Kellerman 1983). Liao and Fukuya (2004) also examined 
advanced Chinese learners of English through three tests of similar types, 
and found that while learners tended to avoid phrasal verbs generally, they 
did so more for figurative ones than literal ones in a translation test. 

In sum, previous literature indicates that the use of phrasal verbs by 
advanced learners of English is influenced by L1-L2 similarity/difference, 
the learner’s perception of transferability, and the degree of L2 complexity, 
with semantically opaque phrasal verbs avoided more than semantically 
transparent phrasal verbs. These studies mainly used written data. To my 
knowledge, research on the use of phrasal verbs in L2 learner’s spontaneous 
speech data produced by Japanese adult speakers is still quite limited. 
Furthermore, a comparison between advanced learners who have experience 
living in an L2 environment and those who don’t is of interest since it may 
indicate different language use or learning processes due to different 
language learning contexts. Since formulaic sequences, including phrasal 
verbs, are prevalent in natural native speaker daily interaction, it is predicted 
that ESL learners will have an advantage over EFL learners in this respect. 


2.2.2. Phrasal verbs in English vs. Japanese 

According to Slobin (2005), English phrasal verbs and Japanese phrasal 
verbs differ especially in the way they express motion with manner. 
Manner-of-motion verbs typically involve the encoding of the motion itself, 
the manner of the motion, and the path of the motion. For example, a 
phrase such as “tiptoe out” consists of motion (“to go/move”), manner (“on 
tiptoe”), and path (“from inside to outside”). English has a tendency to 
express motion and manner together as a single lexical item (as in “tiptoe”), 
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while Japanese has a tendency to express motion and manner as separate 
lexical items, with path combining with motion, and manner expressed as an 
additional noun phrase in many cases. Hence, the most straightforward 
translation in Japanese of “tiptoed out” would be “shinobiashi-de dete-iku,” 
literally translated as “on-tiptoe out-go” or “go out on tiptoe.” Slobin 
suggests that the difference in the ways phrasal verbs _ including 
manner-of-motion verbs are realized in English and Japanese motivate 
Japanese learners of English to use phrasal verbs in forms such as “go out on 
tiptoe” or “come in flying” more often than “tiptoe out” or “fly in,” for 
instance. 


3. Research agenda 
3.1. Some specifications of the present investigation 

The present study attempts to examine the use of phrasal verbs in 
narrative speech data by Japanese EFL learners and ESL learners/returnees. 
Narratives are beneficial in examining the learners’ BICS level language 
performance. Here, instead of letting learners tell their own stories about 
themselves, I have selected to give them a story they have to tell, so that the 
content of the speech is not only controlled but also so that the learners are 
forced to describe particular scenes in a given storyline. The selected story 
used in this study consists of many scenes where motion needs to be 
expressed. Also, narrative style in general is predicted to encourage the use 
of formulaic sequences. 


3.2. Research questions and hypotheses 
The general research question I ultimately aspire to answer is: How are 
formulaic sequences acquired and used in English learners’ spontaneous 
speech performance? In the present study, I limit myself to examining the 
use of phrasal verbs by advanced Japanese learners of English in their 
performance in a story telling task, and address the following four more 
specific research questions: 1) Do advanced Japanese learners of English use 
phrasal verbs in near-native like frequency? 2) Is the use of phrasal verbs 
affected by the L2 complexity of phrasal verbs? 3) Is the use of English 
phrasal verbs, especially those that describe motion, affected by Japanese? 
4) Do advanced Japanese learners of English who have experience acquiring 
L2 naturally in an English-speaking environment (here, referred to as ESL 
learners) use phrasal verbs more often or in a more native-like way than 
those who do not have such experience (here referred to as EFL learners)? 
The corresponding hypotheses I attempt to test are as follows: 
Hypothesis I. Advanced learners use more phrasal verbs than less advanced 
learners but the relative frequency of phrasal verbs is lower compared to that of 
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native speakers. 

Hypothesis II. Learners tend to use semantically transparent phrasal verbs 
more than semantically opaque phrasal verbs. 

Hypothesis III. The structure of the phrasal verbs that are used by the learners 
are affected by the differences that exist between the way English and Japanese 
realize phrasal verbs depicting motion with manner. 

Hypothesis IV. Although advanced ESL learners and EFL learners do not differ 
significantly in the number of word types and tokens they use in narrative 
speech, ESL learners use more phrasal verbs and more English-like idiomatic 
ones than EFL learners. 


4. Methodology 

Subjects. Subjects were 20 university students in Japan. All students 
were either English language majors or minors. Their TOEIC scores 
ranged from the 600s to the 900s’. Among the 20 students, 5 students had 
the experience of living in an English-speaking country for three or more 
years (these subjects are called ESL learners in this study), while the other 15 
had no experience living in an English-speaking country (these subjects are 
called EFL learners). All subjects received EFL education in Japan from 7" 
grade upon entering junior high school. Data from two native American 
English speakers were collected for baseline comparison. 

Data. Subjects were asked to complete a survey that asked about their 
language background. Subjects individually met with the researcher and 
were given a picture book, titled Frog, where are you? This is a picture 
book that is commonly used in eliciting speech data in L1 and L2 acquisition 
studies. It has no language in it, yet has a clear storyline that is depicted by 
a series of pictures. The story is of a boy and a dog going out in the woods 
to search for a pet frog. Many scenes involve the encounter of the boy and 
dog with various types of animals in the woods and a lot of motion by the 
boy and the animals. The subjects were asked to look through all the 
pictures first, then go back to the first page, and tell the story in English 
whenever they were ready. Although free planning time was given, only 
one learner spent a considerable time preparing her story. They were not 
allowed to take any notes. After the subjects had finished telling the story, 
they were asked to listen to the recording of their own narrative, and 
comment in Japanese on any difficulties or uncertainties they experienced 
while telling the story and report any communication strategies that they 


Some of the subjects had never taken the TOEIC test. However, since those subjects did 
have different levels of qualification based on the STEP test, their STEP level was 
converted to corresponding TOEIC scores, using the conversion scale offered by Oxford 
University Press. 
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employed to cope with such difficulties or uncertainties. 

Data analysis. Both the English story telling data and the Japanese 
retrospection data were transcribed using the CHILDES and CLAN language 
analysis software. The frequency of word types and tokens that were used 
were tallied and type/toke ratios computed. The frequency of phrasal verb 
types and tokens that were used as well as the percentage of phrasal verb 
tokens that were used correctly were also calculated, and compared among 
EFL learners, ESL learners, and native speakers for a quantitative analysis. 
Furthermore, an examination of the characteristics of phrasal verbs that were 
used was conducted. Finally, comments gathered through retrospection 
were utilized as qualitative data to supplement the quantitative analysis. 


5. Results 
5.1. The frequency of phrasal verb use 

Table 1 lists the results of the tally. The average number of word types, 
tokens, and type/token ratio, as well as the average number of phrasal verb 
types, tokens, and percentage of phrasal verbs that were used correctly are 
shown respectively for the Japanese EFL learners, ESL learners, and English 
native speakers. 


Table 1. Average number of word types, tokens, and type/token ratio, phrasal verb (PV) 
types and tokens, and average percentage of correct phrasal verb use by EFL 
learners, ESL learners, and native speakers (NS) 


EFL learners ESL learners Native Speakers 
Word types (frequency) 107.40 110.71 156.0 
Word tokens (frequency) 323.40 359.93 429.0 
Type/token ratio 0.32 0.35 0.36 
PV types (frequency) 10.2 10.0 29.0 
PV tokens (frequency) 14.2 13.2 36.0 
Correct PV use (%) 88.3 95.5 100.0 


Table 1 indicates that although the percentage of error-free phrasal 
verbs is slightly higher for ESL learners than EFL learners, the two groups 
are very similar in terms of the number of word types and tokens as well as 
the number of phrasal verb types and tokens they used in the story telling 
task. On the other hand, advanced Japanese learners of English, regardless 
of whether they have experience of learning English in an L2 environment or 
not, use significantly less phrasal verbs compared to native speakers in 
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spontaneous narrative speech. A Chi-square analysis of the frequencies of 
word types, word tokens, phrasal verb types, and phrasal verb tokens among 
the three subject groups revealed that the difference is statistically significant 
(x °=16.01 df=6 p< .015), but that the significance is due to the difference 
between native speakers and learners, and not to the difference between the 
two types of learners. 

As there was no significant difference between the ESL learners and the 
EFL learners, a comparison was also made between the lower half of all the 
ESL and EFL learners combined (i.e. ten subjects who had TOEIC scores 
ranging from 600 to 800) and the upper half of all the learners combined (i.e. 
ten subjects who had TOEIC scores of 840 to 960) to see if there were any 
differences due to L2 proficiency level. The percentage of correct use of 
phrasal verbs was higher for the higher proficiency group than the lower 
proficiency group, but the numbers of word types, tokens, or phrasal verb 
types, and tokens were very similar between the two learner groups (See 
Table 2 below). Chi-square analysis revealed that the difference among the 
subject groups is significant (x °=17.365 df=6 p< .01), but again it is the 
native speakers’ performance that mainly contributes to the difference. 


Table 2. Comparison of the average number of word types, tokens, type/token ratio, 
phrasal verb (PV) types and tokens, and average percentage of correct phrasal 
verb use by lower learner group, higher learner group (both ESL and EFL learners 
combined), and native speakers (NS) 


Lower Group Higher Group Native Speakers 
Word types (frequency) 107.6 112.4 156.0 
Word tokens (frequency) 347.5 353.4 429.0 
Type/token ratio 0.32 0.33 0.36 
PV types (frequency) 9.4 10.9 29.0 
PV tokens (frequency) 13.2 14.7 36.0 
Correct PV use (%) 87.1 92.5 100.0 


5.2. The types of phrasal verbs used 

Table 3 lists the proportion of different types of phrasal verbs used in 
the story telling task by EFL learners, ESL learners, and native English 
speakers, together with some representative sample expressions that were 
employed. 


Presented by: https://jafrilibrary.com 


Presented by: https://jafrilibrary.com 


The Use of Phrasal Verbs 211 


Table 3. Average percentage of phrasal verbs (PV) used by EFL learner, ESL learners, and 
native speakers (NS) according to semantic transparency 


EFL learners ESL learners Native Speakers 
% Semantically 77.0 68.2 86.0 
transparent PV 
Sample PVs fall in, go out, fly out, sneak out, climb up, creep out, 
run away throw out throw down 
% Semi-transparent PV 20.7 30.3 11.1 
Sample PVs come up, find out, _find out, get up, call out, piss off, 
wake up go over scare away 
% Non-transparent PV 2.3 1.5 2.8 
Sample PVs keep on, giveup  giveup,showup fool around, turn out 


We can see from Table 3 that the majority of phrasal verbs used by all 
speakers were semantically transparent ones. This is most likely due to the 
nature of the storyline depicted in the picture book. The scenes in the 
picture book were predominantly those that described concrete physical 
motions of the boy and various animals. Contrary to what was predicted, a 
comparison of the proportion of different types of phrasal verbs used by 
learners and native speakers did not indicate that learners use less 
semantically opaque/non-transparent phrasal verbs compared to native 
speakers. However, it was found that between the ESL learners and the 
EFL learners, ESL learners, like native speakers, use phrasal verbs with main 
verbs that have more specific meaning than EFL learners, who, in 
comparison, tend to use phrasal verbs with verbs that have more general 
meaning, such as come, go, and look. Specific verbs tended to be lexical 
items that realized motion with manner, i.e. more English-like, such as creep 
out and sneak out. Thus, although the ESL learners and EFL learners did 
not differ quantitatively in their use of English phrasal verbs, they differed 
discursively in that ESL learners used more specific verbs which made their 
manner-of-motion phrasal verbs sound more English-like. Moreover, ESL 
learners tended to use more semantically semi-transparent phrasal verbs 
compared to EFL learners. However, due to the limited number of native 
speaker data used in the present study, a reliable comparison between the 
learners’ performance and the native speakers’ performance cannot be made. 


5.3. Observations based on retrospective data 
An examination of the retrospective data revealed that the learners were 
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generally aware of their lack of grammatical and lexical knowledge. They 
reported using various communication strategies, a majority of them, in 
compensation for the lack of particular noun or verb expression. Examples 
of expressions actually used (for intended nouns) include various forms of 
paraphrase such as approximation, circumlocution, and word coinage. E.g. 
bird (owl), a small creature (mole/skunk), squirrel (raccoon), big animal, 
horse with two horns (reindeer), house/cage/box/glass (jar), stick of wood 
(branch), hill (cliff), tree (log), bee’s house/honeynest* (beehive), crack 
(hole), big stone (rock), voice/ someone speaking/ heard something (croak). 
Examples of expressions actually used (for intended verb expressions) are 
get/be angry at (irritated/upset/annoyed/feel sorry), is important (precious), 
frog where are you (call out for the frog), put into (drop/throw into) the pond, 
saw (came across), run away/escape/there is no frog (vanished/disappeared), 
hinded* (hide behind), bring up* the boy (carry the boy on its head), was 
followed by/run after/got attacked (got chased by), the deer striked* them 
and so they fell down (the deer threw them over the cliff), and told the dog to 
be quiet (hushed/shushed the dog). 

As can be seen from the examples above, some of the learners’ attempts 
to express certain ideas resulted in non-words or erroneous use of words. 
In many cases, learners commented that they simply gave up searching for 
the right word and resorted to avoidance of certain expressions or sometimes 
even of entire scenes. For example, different learners reported avoiding 
verb expressions such as approached/came towards, crept up, and dashed 
across the field, because they “just didn’t know how to express the specific 
motion in English.” The majority of them generally had trouble finding 
specific vocabulary items to describe a certain situation in detail. Some of 
these involved searching for but failing to use particular phrasal verbs. 

The learners also commented on the grammatical correctness of their 
speech. Some reported doubt about how to use verb tense or aspect 
effectively in describing a story. For example a few of them said they were 
not sure whether they should keep on using the present tense, the past tense, 
or mix them in some way. Others were uncertain about the choice between 
simple or progressive aspect. In general, learners had problems describing 
expressions with inherent meaning of continuity (e.g. had been keeping a 
frog), simultaneity (e.g. meanwhile the dog was playing), and abruptness (e.g. 
out flew an owl from the hole). A few learners had difficulty selecting the 
correct preposition. E.g. look for vs. look into; bark on vs. bark at ; a hole 
in vs. a hole on the ground. One EFL learmer was concerned about the use 
of articles in his speech. Three learners who majored in German, Italian, 
and Portuguese, respectively, said they were aware of cross-linguistic 
influence of word choice and grammar from the language they majored in. 
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A few learners also hesitated on whether to use active or passive voice to 
express an event described in a scene. 

The choice of verb tense and aspect as well as that of sentence voice is 
also related to discourse strategies or effective story telling techniques. One 
ESL learner was concerned with his use of pronouns to refer to the boy, dog, 
and frog that appear in the story. He decides to use “he” for all of them, but 
says he wasn’t sure throughout the task. One EFL learner chose to name all 
the main characters. Others did not give much thought to calling them “he” 
initially, but later became aware that the use of “he” for all characters could 
be confusing for the listeners of their narratives. 

Other discourse strategies reported included trying not to restrict oneself 
to a mere description of scenes, but instead attempting to “tell a story”; 
trying to avoid using the same verb expression over and over because it 
sounds monotonous; and making an intentional selection between which part 
of the picture to describe and which part to omit in order to stick to the main 
storyline. Difficulties were expressed over how to effectively describe the 
boy’s emotional change as the story developed, and how to connect the 
individual descriptions of scenes into a coherent whole. A number of 
learners utilized the title of the picture book, “Frog where are you,” which 
were actually the only words printed in the entire book, on the cover page. 
Some said the title served as a useful line for the boy to say to make the 
narrative sound more story-like. One learner pointed out that he used the 
line repeatedly to earn some planning time to describe the next scene. 
Another said she was confident of using the line because it was in the title. 
Most learners used the line more than once in their speech. In a sense, the 
title functioned as a given formulaic sequence to many of the subjects. 

One ESL learner commented she had fun telling the story because she 
used to tell stories a lot when she lived in the U.S. during elementary school, 
but a few EFL learners were totally embarrassed to realize that they were not 
capable of telling a simple story effectively in English. Still others had 
trouble interpreting the meaning of the pictures sometimes, such as figuring 
out what the animal in the picture was supposed to be, or judging how the 
boy felt from his facial expressions, or which one of the frogs appearing in a 
scene was the boy’s pet, etc. Finally, several learners commented on how to 
begin and end a narrative. “Once upon a time, there lived.....” was used by 
three learners, two ESL learners and one EFL learner. One of the ESL 
learners commented that “since most kid stories seem to end with a phrase 
like, ‘and he said bye and went home,’ I decided to end the story that way.” 
Two EFL learners used an unconventional expression, “That’s all” to end the 
narrative. Several added “the end” to mark the end of the narrative. 
Others expressed doubts about how they should end a story. And one EFL 
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learner said he was determined to use “and they lived happily ever after” 
right from the beginning. 


6. Discussion 
6.1. Answers to research questions and hypotheses 

Hypothesis I: (Advanced learners use more phrasal verbs than less 
advanced learners but the relative frequency of phrasal verbs is lower 
compared to that of native speakers) was only partially supported. The 
frequency of phrasal verbs was indeed lower for learners compared to native 
speakers, but there was no significant difference between the higher 
proficiency group and the lower proficiency group of learners. The analysis 
of phrasal verbs used did suggest that ESL learners use different types of 
phrasal verbs compared to EFL leamers, a point I will return to in discussing 
Hypothesis II below. 

Hypothesis II: (Learners tend to use semantically transparent phrasal 
verbs more than semantically opaque phrasal verbs) was supported. 
However, subjects including native speakers used more semantically 
transparent phrasal verbs largely due to the nature of the storyline to be 
described. Hence, the hypothesis cannot be regarded as fully supported. 

Hypothesis II: (The structure of the phrasal verbs that are used by the 
learners are affected by the difference that exists between the way English 
and Japanese realize phrasal verbs depicting motion with manner) was 
confirmed. This tendency was observed more clearly in the EFL learners’ 
selection of phrasal verbs than ESL learners, in that the EFL learners tended 
to use more phrasal verbs that included main verbs with general meaning, 
while ESL learners used phrasal verbs that included verbs that had more 
specific meaning, sounding more English-like because they were 
lexicalizations of motion and manner combined. 

Hypothesis IV: (Although advanced ESL learners and EFL learners do 
not differ significantly in the word types and tokens they use in narrative 
speech, ESL learners use more phrasal verbs and more English-like idiomatic 
ones than EFL learners) was partially supported. It was found that ESL 
learners did not use phrasal verbs significantly more frequently than EFL 
learners. However, ESL learners did tend to use phrasal verbs in a more 
native-like way. 

Overall, results confirmed previous findings that phrasal verbs are 
difficult to use even for advanced learners of English. Although learners 
with experience of learning English naturally in an L2 context appear to have 
some advantage over those who do not, the difference is slight compared to 
the gap between the frequency of phrasal verbs used by native speakers and 
advanced learners in general. Furthermore, retrospective data revealed that 
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the learners were in many cases quite aware of their inability to use specific 
vocabulary items, including the exact phrasal verbs that would effectively 
describe events in a scene. Although they attempted to compensate for 
their lack of vocabulary through various forms of paraphrase, they often 
intentionally avoided using any expression at all or sometimes even gave up 
describing an entire scene. Thus, it seems safe to say that learners’ non-use 
of phrasal verbs was at least partially but clearly due to avoidance, not to a 
willing choice of non-use. Furthermore, the subjects seemed to be aware of 
some discourse strategies that would be effective in story telling, however, 
they did not possess enough grammatical knowledge nor formulaic 
sequences to actually implement many of such discourse strategies. 

The result presented here in itself does not address the cause of 
difficulty in using phrasal verbs by L2 learners. If input frequency is the 
major factor contributing to the acquisition of formulaic sequences, several 
years of residence in an L2 environment might not suffice. Obviously, 
more general words are more frequent in language input, so learners may not 
have been exposed to sufficient exemplars of a variety of phrasal verbs in 
context within several years of residence. Moreover, it has been pointed 
out that speech addressed to non-native speakers often includes less idioms 
in an attempt to modify the input (Irujo 1986). Overall, the findings 
suggest that even in cases of relatively advanced adult learners of English, 
learners have much difficulty in using the most appropriate expressions, 
including various formulaic sequences, without resorting to paraphrases. 
This result is in line with the general finding reported in vocabulary 
acquisition, that learners have an overwhelming tendency to rely on general, 
superordinate terms compared to native speakers who mostly use more 
specific terms with more restrictions of meaning and use (Laufer 1990). 


6.2. Limitations of the present study 

There are obviously numerous limitations to the present research. 
Here I only point to some of the major ones. With regards to the 
methodology adopted, the number of subjects was small and uneven, and no 
tests were given to check the learners’ comprehension knowledge of phrasal 
verbs. Non-use is not equivalent to avoidance or non-acquisition. 
Although the retrospective data proved to some extent that some non-use 
was due to avoidance, such interpretation does not apply to all cases. 
Clearly, just examining L2 use by a limited number of learners at a particular 
time in a particular task does not tell us much about the acquisition process 
of the expressions used or unused. 

The choice of using a story telling task presented advantages and 
disadvantages. The advantages were that the content of the story could be 


Presented by: https://jafrilibrary.com 


Presented by: https://jafrilibrary.com 


216 Asako YOSHITOMI 


controlled and that phrasal verbs of motion could be expected to appear in 
the data to enable an examination of the possible influence of Japanese 
phrasal verb structure on the use of phrasal verbs in English. At the same 
time, the choice of the particular story may not have been appropriate for 
observing non-transparent phrasal verb use. It is also true that telling a 
given story could be considerably more demanding for any speaker than 
talking freely about their own selves. In many ways the task was probably 
quite different from a BICS-type daily conversation. 

Further difficulties lie in the identification of phrasal verbs and their 
classification into distinct types. Some expressions are border-line cases 
especially when the same phrasal verbs have multiple meanings that differ in 
degree of semantic transparency depending on the context in which they are 
used. Although phrasal verbs are generally more identifiable than other 
types of discontinuous formulaic sequences with more flexibility and open 
slots, a reliable categorization is still a major issue. Because of the overall 
lack of internal and external validity of the present research, it is best 
considered as a preliminary study that should be further developed in a more 
rigorous way. 


6.3. Some considerations for future research 
6.3.1. Theoretical considerations 

Despite the fact that this study only addresses the use of phrasal verbs in 
narratives by advanced Japanese learners of English, my ultimate goal lies in 
contributing to a better understanding of what constitutes communicative 
competence of a mature speaker, how language is processed in real-time use, 
and how L2 learners develop such abilities. Weinert (1995) and Yorio 
(1989) rightly point out that formulaic sequences have received only 
marginal attention in linguistic and SLA theories. With a general 
recognition of the prevalence of formulaic sequences in language use, it is 
high time they should be incorporated more into such theories. 

Ever since the construct of communicative competence was postulated 
by Hymes (1972), attempts have been made to define what it is that truly 
constitutes one’s ability to communicate correctly and appropriately as a 
member of a society. The well-known classical model of communicative 
competence suggested by Canale and Swain (1980) states that 
communicative competence consists of grammatical competence, discourse 
competence, sociolinguistic competence and strategic competence. If the 
knowledge of formulaic sequences and the ability to use them appropriately 
are to be incorporated somehow in such kind of model, it would affect every 
subcomponent of the framework. Formulaic sequences are part of 
linguistic knowledge, with a position somewhere in between syntax and 
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lexicon. They contribute greatly to discourse and sociolinguistic/pragmatic 
competence. In addition, formulaic sequences function as production 
strategies to enhance fluency or communication strategies to compensate for 
the lack of linguistic knowledge, and therefore serve as a crucial element 
subsumed under strategic competence as well. Therefore, the incorporation 
of formulaic sequences into such a model of communicative competence is 
likely to have an overarching effect on the entire framework. 

Peters (1983) and Nattinger and DeCarrico (1992) claim that the role of 
formulaic sequences in language development is underestimated. Likewise, 
Weinert (1995) stresses the significance of further investigating the role of 
formulaic sequences in language learning. Skehan (1998) proposes a 
cognitive model of language processing and learning which suggests that 
speakers draw upon non-analytic knowledge in addition to analytic 
knowledge to deal with the primacy of meaning and the pressures of 
processing language in real time. In this model, language is believed to be 
processed by a dual-mode system, yet the exemplar-based system, namely, 
lexical means governed by the idiom principle, takes precedence, and 
speakers resort to the other rule-based system, or open-choice principle, only 
when they have to. The result is what we perceive as native-like fluency 
and native-like selection, or the capacity to sound idiomatic and natural in 
language use. Language development, according to this framework, takes 
place through a sequence of lexicalization, syntacticalization, and 
relexicalization. | Such theoretical development can be regarded as 
effectively incorporating the role of formulaic sequences in language use and 
development from a psycholinguistic point of view. A similar progress can 
be expected in a model that also addresses the sociolinguistic significance of 
formulaic sequences. 


6.3.2. Methodological considerations 

As stated in section 6.2, a major obstacle in conducting research on the 
acquisition of formulaic sequences lies in the nature of the formulaic 
sequences themselves. They cannot be defined in any absolute sense, or 
reliably extracted by a simple concordance analysis on linguistic corpora. 
On a positive note, Weinert (1995) states that the definition or criteria used to 
identify formulaic sequences would necessarily vary according to the focus 
of research but that there is considerable overlap across linguistic, 
psycholinguistic, and developmental criteria adopted among different studies. 
She also adds that if we accept that formulaic sequences could be explored 
based on a usage-based language theory, all criteria proposed would become 
relevant in investigating the units of language use and development. 

Vocabulary acquisition, in general, is a challenging area of exploration, 
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since vocabulary is an open system, not a rule-governed closed system like 
syntax. This does not mean that the acquisition of vocabulary is random. 
The relative ease with which some words are acquired and the difficulty 
experienced in acquiring others along with the similar lexical deficiencies 
observed in the learner language by learners of different backgrounds, 
suggest that vocabulary acquisition is governed by certain regularities 
(Laufer 1990). How such regularities could be clearly elucidated is still 
largely an unresolved problem. 

A further difficulty is caused by the general lack of spontaneous speech 
data by L2 learners that could be readily used for various research purposes. 
The time-consuming process of transcribing and coding spoken data, 
especially of L2 data full of uninterpretable utterances with deviant forms, 
has prevented the accumulation of corpora compared to L1 data or L2 
written data. Endeavors must be continued to build a larger reliable corpora 
of L2 oral data of learners with various backgrounds engaging in a variety of 
different tasks for future SLA research. 


6.3.3. Pedagogical considerations 

In the area of language pedagogy, assisting learners in acquiring 
communicative competence in the target language has become one of the 
major goals of language instruction. In a summary of the characteristics a 
communicative language teaching approach should have, Richard-Amato 
(1996) lists the following features: attaining communicative competence as a 
goal; implementing techniques that involve language practice with pragmatic, 
authentic, and meaningful purpose; developing both accuracy and fluency; 
providing opportunities for productive and receptive use of language in 
unrehearsed contexts; and emphasizing the acquisition of functional 
categories over grammatical rules. It is interesting to note that formulaic 
sequences seem to relate to the realization of all of these features. 

Despite the significance of formulaic sequences in language use and 
most likely also in language development, it has received relatively little 
attention in L2 teaching until recently. Since many formulaic sequences are 
idiomatic or have open slots with semantic restrictions, it is very difficult for 
L2 learners, especially foreign language learners to acquire them. Although 
so-called “phrase books” are quite common, there is much more to acquiring 
formulaic sequences than memorizing a number of practical set phrases for 
certain situations. Willis (1990), Nattinger and DeCarrico (1992), and 
Lewis (1993) are three representative attempts to foreground formulaic 
sequences into language teaching. For example, Lewis claims that the 
dichotomy of grammar and vocabulary is invalid, and that language should 
be conceived of being constituted of grammaticalized lexis instead of 
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lexicalized grammar. He also maintains that collocation is useful as an 
organizing principle. However, Wray points out after reviewing these three 
approaches that they are attempts “pursuing native-like linguistic usage by 
promoting entirely unnative-like processing behaviour” (2000; 463); that is, 
all three encourage learners and teachers to analyze multi-word strings into 
their component parts for pattern recognition, yet native speakers use 
formulaic sequences specifically to bypass such analytical processing. She 
also mentions that introducing formulaic sequences into classroom teaching 
will not easily solve the problem of how to make L2 learners sound 
idiomatic. 

Irujo (1993) and Yorio (1980) both emphasize the importance of 
exposing L2 learners to idioms as much as possible. Irujo (1986) claims 
that most materials for teaching idioms are inadequate and suggests some 
criteria for selecting idioms, such as frequency of use, transparency, 
appropriateness, simplicity of form and vocabulary, similarity to L1, and the 
construction of student-generated lists of idioms. She also emphasizes that 
teaching idioms should be an integral part of L2 vocabulary learning from 
the beginning level. 

Bahns and Eldaw (1993) and Farghal and Obiedat (1995) both focus on 
the importance of teaching collocation to L2 learners. Bahns and Eldaw 
(1993) studied German advanced EFL learners’ productive knowledge of 
English collocations, and found that the knowledge of collocations does not 
expand in parallel with knowledge of general vocabulary. They maintain 
that collocations must be acquired to attain full mastery of communicative 
competence in English, and propose that EFL teaching should emphasize 
teaching collocations, especially those which cannot readily be paraphrased. 
Fargahl and Obiedat (1995) studied Arabic learners of English and found that 
non-native EFL teachers as well as advanced learners were seriously 
deficient in their knowledge of collocations. They suggest that collocations 
should be taught on a par with idioms, and insist that it is essential to pay due 
respect to instruction based on the idiom principle in addition to the 
traditional paradigm based on the open-choice principle. Similarly, Wray 
(1992) claims that we should move away from a simple analytic/synthetic 
dichotomy towards a more integrated view of language processing, and 
Weinert (1995) suggests that language needs to be regarded as a 
formulaic-creative continuum. 

Lexicology has long been aware of the prevalence of formulaic 
sequences in native language use. However, only recently has 
English-Japanese dictionaries begun to consider differences in the realization 
of manner-of-motion phrasal verbs between English and Japanese 
(Yoshitomi 2005) or have attempted to set up a separate column for the 
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introduction of fixed as well as semi-fixed formulaic sequences with open 
slots, that do not necessarily file into the traditional categories of 
collocations, idioms or phrasal verbs (Yoshitomi 2002). If learners were to 
acquire the knowledge of formulaic sequences with the help of dictionaries, 
layouts and designs of dictionary pages that present relevant information in 
an explicit and user-friendly way should be experimented. At the same 
time, giving guidance on how to effectively use dictionaries to learn 
formulaic sequences should be emphasized more in language instruction. 

In English language classrooms in Japan, and undoubtedly in many 
other countries, instruction tends to emphasize rote memory of single 
vocabulary items without much instruction on how vocabulary can be 
learned effectively. At the same time, instruction often forces learners to 
memorize a large amount of complete sequences without necessarily making 
the learners aware of the compositional patterns of the sequences. These 
sequences tend to be formally constrained, complex, situationally 
unconstrained, and controlled by the teacher, when in fact, they should be 
chunks of varying sizes but mainly below clause size, contextually 
constrained, selected at least partially by the learner, and involve some form 
of pattern analysis (Weinert 1995). 

Recent interests in interlanguage pragmatics suggest that conscious 
learning of the pragmatic aspects of language is possible and effective 
through explicit instruction (Kasper 2001). Since formulaic sequences 
seem to be closely linked to social situations and contribute considerably to 
pragmatic effectiveness in communication, an attempt to combine the 
instruction of formulaic sequences with instruction aimed to facilitate 
pragmatic competence could be promising as well. 

In the area of L2 testing, too, it would be essential to include an 
assessment of the learners’ ability to use formulaic sequences appropriately 
in communicative situations. So-called “advanced” foreign language 
learners, including the subjects in the present study, are generally assessed as 
advanced based primarily on their knowledge of syntax and general 
vocabulary. More weight may need to be put on the learners’ ability to use 
formulaic sequences such as idioms, collocations, and phrasal verbs to arrive 
at a more accurate evaluation of the learners’ proficiency level (Yorio 1989). 

One final line of research I would like to suggest is the development of 
L2 maintenance programs for returnees which take formulaic sequences into 
account. Japanese returnees usually refer to Japanese learners of English 
who have spent two or more years in an English-speaking country before 
puberty and have acquired L2 naturally through abundant opportunities to 
receive L2 input and experience L2 interaction. Since it is likely that such 
returnees have an advantage over non-returnees concerning their knowledge 
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of formulaic sequences, it would be quite effective to stress its importance in 
the language programs that aim to maintain and enhance their L2 skills after 
they return to Japan. So far, I am unaware of any L2 maintenance program 
for returnees with such focus in mind. Studies on the attrition and 
re-learning process of formulaic sequences have been scarce (Yoshitomi 
1994). Although there are a number of studies on aphasics (Wray 2002), it 
would be intriguing to examine the nature of the attrition process of 
formulaic sequences by non-clinical cases of L2 learners, and ultimately 
apply the findings to improve the language maintenance programs in Japan. 


7. Conclusion 

The present study examined the use of phrasal verbs by advanced EFL 
and ESL learners in a story telling task. Despite a host of limitations of the 
study, it confirmed findings in previous literature that formulaic sequences, 
even relatively identifiable ones like phrasal verbs, are difficult to acquire. 
Nonetheless, the ubiquity of formulaic sequences in natural language and 
their social relevance in communication overall imply that the role of 
formulaic sequences should be given greater prominence in linguistic and 
SLA theories. Although methodological difficulties present a huge obstacle 
for conducting rigorous research, a steady advance has been made during 
this decade thanks to the technological progress in computer-based analyses 
of large-scale corpora and the slow but sound accumulation of small-scale 
investigations, such as the present one. Effective approaches to teaching 
formulaic sequences in L2 instruction will need further exploration. 
However, the importance of putting due weight on teaching formulaic 
sequences per se seems now to be gaining the acknowledgement it deserves. 
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Learning a Second Language through 
Audiovisual Media — A Longitudinal 
Investigation of Strategy Use and 
Development — 


Tae UMINO 


1. Introduction 

This paper aims to uncover the strategies that learners use in learning a 
second language (L2) through audiovisual media’. Audiovisual comprehension 
ability or the ability to make sense of the content provided via audiovisual 
media is an essential skill for learners who attempt to learn an L2 by using 
any kind of multimedia’ resource. It becomes particularly important in 
modes in which learners attempt to learn an L2 by using such resources 
without receiving direct instructions from a teacher or a tutor, such as in 
self-access® and self-instruction’. When learners work in these modes, their 
ability to comprehend the content of the materials and to derive the utmost 
benefit from the media characteristics affects the learning outcomes more 
directly than in the case of classroom learning. Thus, in order to provide 
appropriate support to self-accessing and self-instructing learners, we need to 
identify the potential problems that learners face in learning with such 
materials in these modes, the potential strategies for overcoming these 


1 Beene ree ae 
Here, the term “audiovisual media” is used broadly to refer to means of communication 


transmitted to both the sense of hearing and the sense of sight through technologies. 

Here, the term “multimedia” is used in a general sense to refer to the integration or 
combination of more than one medium—including text, graphics, animation, audio, and 
video—with or without the use of computers. 

Self-access has a wide range of definitions, varying from ways of organizing the 
materials (Sheerin 1991) to the whole environment involving learners and teachers as 
well as resources. Here, I define self-access based on Gardner and Miller’s (1999) 
definition: it is a mode of learning in which learners learn by interacting with the 
environment involving resources, teachers, and learners, organized with the aim of 
enabling learners to work in their own unique ways. 

Self-instruction is a mode of learning in which individuals take the initiative to learn 
without receiving control from institutionalized education (Umino 2005). Dickinson 
(1987) points out that in most cases, learners are only allowed to self-instruct by relying 
on a particular set of materials; he refers to such a mode of learning as “material-centered 
self-instruction.” 
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problems, and the potential strategies for deriving the utmost benefit from 
the media characteristics for their learning. However, despite the amount of 
attention and effort directed toward the development of such materials to be 
used in self-access/self-instruction, there exist very few studies that address 
the extent to which learners comprehend the content provided in the 
materials and/or their actions in the attempt to learn effectively from the 
materials in these modes. 

The present paper addresses the above issues based on a longitudinal 
study that investigates a group of adult learners who study Japanese with a 
self-instructional package comprising videos and textbooks. The study 
tracked seven learners’ self-instructional process over 14 months, comprising 
four periods of data collection (see section 3 for details). Umino (2003) 
reports the results from the first period and identifies some problems that 
learners face, particularly in coping with the variety and amount of 
information provided via audiovisual media. In this paper, I report the results 
from the second period to discuss the strategies used for learning with 
audiovisual media, including those for overcoming the problems identified in 
Umino (2003) mentioned above. Further, I also report some noteworthy 
developments in the learners’ strategy use over the seven-week period, which 
were observed due to the longitudinal nature of the study. 

Based on these findings, we discuss (1) strategies that are useful for L2 
learning with audiovisual media in general, (2) the implications for materials 
development, and (3) the methodology for investigating learners’ strategies 
in a more general manner. 


2. Learners’ strategy use in L2 audiovisual comprehension 

Language learning strategies are “conscious thoughts and actions that 
learners take in order to achieve a learning goal” (Chamot 2004:14) (see 
Umino et al. 2004 for a summary of the definitions of L2 learning strategies). 
Considerable time has elapsed since the significance of strategies in L2 
acquisition research was recognized. In the early years, researchers 
investigated the strategies used by “good language learners” (Naiman, 
Frohlich, Stern, and Todesco 1978; Stern 1975) with the aim of building a 
basis on which to train unsuccessful learners to use the so-called “good” 
strategies. As a result, L2 learning strategies were classified and the 
relationships between the types of strategies and learner characteristics were 
clarified. More recently, however, researchers have come to understand that 
no single strategy is superior to others in its own right and that the value of a 
strategy can only be identified in relation to the context of its use, which 
comprises the purpose of learning and the nature of the task (Chamot 2004). 
In other words, we need to match the strategies that assist learning with the 
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contexts in which they are used. 

The focus of this paper is to describe strategies as used in the 
comprehension of audiovisual media in self-instructed learning. As 
compared with other skills, there are only a few works on listening strategies 
in general and even fewer works on viewing (Swaffar and Bacon 1993; 
Rubin 1994). Given below is a review of the works concerning audiovisual 
comprehension in a “transactional” mode (Brown and Yule 1983), that is, 
within one-way transmission of information rather than in an interactional or 
two-way exchange. 

The most widely adopted taxonomy for describing listening and 
viewing strategies is that of O’Malley and Chamot (as seen in O’ Malley et al. 
1989; Bacon 1992a, 1992b; Vandergrift 1996, 1997; Goh 1997, 1998), which 
divides strategies into cognitive, metacognitive, and social/affective types 
(see Table 2.1)°. Cognitive strategies have an operative or cognitive-processing 
function and involve “interacting with the material to be learned, 
manipulating the material mentally or physically, or applying a specific 
technique to a learning task.” Metacognitive strategies make use of the 
knowledge on cognitive processes and constitute an attempt to regulate 
language learning by using means such as planning, monitoring, and 
evaluation. Social strategies involve interacting with another person, whereas 
affective strategies involve using affective control; both are aimed at 
assisting learning. 


Table 2.1 Taxonomy of learning strategies in O’ Malley and Chamot (1990) 


Metacognitive strategies: These involve thinking about the learning process, planning for 
learning, and planning for managing and supervising the strategy use. 
Planning/Directed attention/Selective attention/Self-management/Self-monitoring/ 
Problem identification/Self-evaluation 


Cognitive strategies: These involve interacting with the material to be learned, 

manipulating the material mentally or physically, or applying a specific technique to a 

learning task. 
Repetition/Resourcing/Grouping/Note-taking/Deduction/Induction/Substitution/ 
Elaboration/Summarization/Translation/Transfer/Inferencing 


° Hsiao and Oxford (2002) claim that Oxford’s (1990) taxonomy comprising three indirect 
strategies (metacognitive, affective, and social) and three direct strategies (cognitive, 
memory, and compensation) is superior to other taxonomies. However, O’Malley and 
Chamot’s (1990) taxonomy is more substantial in the cognitive subcategories and is 
considered more suitable in accounting for the task of transactional audiovisual 
comprehension. 
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Social/affective strategies 

Social strategies: These involve interacting with another person to assist learning. 
Questioning for clarification/Cooperation 

Affective strategies: These involve using affective control to assist a learning task. 
Self-talk/Self-reinforcement 


The studies till date have concentrated on discovering strategies that are 
useful for effective listening rather than audiovisual comprehension. In 
general, metacognitive strategies of “comprehension monitoring” and 
“selective attention” and cognitive strategies of “elaboration,” “inferencing,” 
and “summarization” have been identified as the most important strategies 
for L2 listeners. Wolff (1987), Bacon (1992a), and O’ Malley et al. (1989), in 
studies dealing with audio materials, also suggest that the strategies 
involving “top-down processing” are more important for L2 listeners than 
“bottom-up” strategies, but that learners’ application of these strategies is 
affected by the text characteristics. Umino (1993), through a study dealing 
with video materials, suggests that as compared with proficient learners, less 
proficient learners rely more heavily on the nonlinguistic clues provided in 
the materials (in other words, they use top-down strategies) and that the 
visuals affect learners’ choice between the use of top-down and bottom-up 
strategies. Vandergrift (1996, 1997), on the other hand, claims that learners 
with higher proficiency use more “deep-processing” strategies than the 
lower-proficiency group, although these concepts require further clarification. 
Other studies refine the knowledge sources of elaboration and inferencing 
strategies (e.g., O’Malley et al. 1989; Vandergrift 1996, 1997; Umino 1993; 
Goh 1997). 

However, with the exception of Umino (1993, 2003), the above studies 
give limited attention to the role of visuals in listening. Vogely (1995, 1998) 
and Chien and Wei (1998) do include visuals (videos and pictures) in their 
materials, but do not discuss their roles in any depth. There is an excessive 
emphasis on the aural mode in listening as compared with visuals in general. 
Thus, there is a need for more research that takes into account the roles of 
visuals in comprehension and the manner in which they affect learners’ 
strategy use. 

In terms of the investigation methodology, the most popular means for 
investigating listening/viewing strategies is verbal reports (e.g., O’Malley et 
al. 1989; Bacon et al. 1990; Bacon 1992a, 1992b; Vandergrift 1996, 1997; 
Goh 1997, 1998; Umino 1993). Listening is one of the more difficult skills to 
investigate since it is not directly observable (Rubin 1994), and the studies 
confirm the advantages of the use of introspective methods. However, there 
is an excessive reliance on quantitative methods for data analysis, such as the 
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frequency counts of types of strategies reported by different types of learners 
or on different text types. As Vandergrift (1996, 1997) suggests, the 
qualitative analysis of learners’ reports may provide a greater insight into 
different aspects of strategy application, and this possibility should be 
pursued further. Furthermore, only a minority of the studies (O’Malley et al. 
1989; Umino 2003) employ a longitudinal approach, although some studies 
indicate that learners’ strategy use is amenable to change over time (Rubin 
1987; O’Malley and Chamot 1990). Whether or not this is the case also 
needs to be addressed further. 


3. The study 
3.1. Aim and setting 

I describe below a part of the study that investigates learners’ strategy 
use in the audiovisual comprehension of L2 lessons presented via video 
materials. The study as a whole was a longitudinal investigation of an L2 
self-instruction process in which seven British adult learners of Japanese 
were observed over 14 months. The learners were asked to gather once a 
week at the home of the researcher or of one of the learners to study using 
the materials; they were also interviewed at each meeting. In this way, the 
data were collected during four periods, which summed up to a total of 34 
weeks (see Table 3.1). In this paper, I report the results obtained from the 
second period of data collection. The participants were adults between 25 
and 59 years of age, living and working in London, the United Kingdom. Of 
the seven learners who participated in Period 1, four (Robert, Dale, Jane, and 
Sally) volunteered to take part in Period 2 and remained in their original 
groupings (Group A: Robert and Dale; Group B: Jane and Sally)°. Period 2 
comprised seven interview sessions conducted separately for the two groups, 
resulting in a total of 14 sessions (see Table 3.2). 


Table 3.1 Summary of the data collection sessions in the four periods of the study 


Period | Session | Time span | Lesson in the material’ Participants 

Period 1 1-11 11 weeks Lessons 2—12 Group A: Robert, Dale, and 
Sharon, 
Group B: Jane, Sally, Kris, 
and Anne 

Period 2 | 12-18 7 weeks Lessons 17—24 Group A: Robert and Dale 
Group B: Jane and Sally 

Period 3 | 19-26 8 weeks Lessons 27-34 Robert, Jane, and Sally 

Period 4 | 27-34 8 weeks Lessons 35-42 Robert, Jane, and Sally 


8 all participants are represented by their pseudonyms. 
In Period 1, Lesson 1 was shown as a sample. In Periods 2 and 3, Lessons 13-16 and 
Lesson 18, and Lessons 25—26 were skipped respectively since the learners were already 
familiar with the target items in these lessons. 
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Table 3.2. Overview of the data collection sessions that occurred during Period 2 


Week| Session | Lesson Target items in the lesson Participants 
1 12 17 _|N ga V-tai (want to —) Group A: Robert and Dale 
(practice) N ga suki/kirai (like/dislike) Group B: Jane and Sally 
2 13 19 | V deshou (will probably —) Group A: Robert 
— mada V-masen (Have/have not | Group B: Jane and Sally 
yet) 
3 14 20 |— kamo shiremasen (— may —) Group A: Robert and Dale 
— sou (— looks —) Group B: Jane and Sally 
4 15 21 |N ga dekimasu (can do —) Group A: Robert and Dale 
Group B: Jane and Sally 
5 16 22 | V-(rar)emasu (Can —/be able to —) | Group A: Robert 
Group B: Jane and Sally 
6 17 23 | Doushite — ka. (Why —?)/— kara| Group A: Robert and Dale 
desu. (Because —) Group B: Jane and Sally 


Doushita n deshou ka (I wonder 
what happened to —.) 

7 18 24 |—kara V-te kudasai (—, so please —.) | Group A: Robert and Dale 
Group B: Jane and Sally 


The self-instruction material that learners used as assistance was the 
“Let’s Learn Japanese” (LLJ) series (The Japan Foundation, 1996) 
comprising a set of videotapes and textbooks. Each lesson is 30 minutes long 
and contains an ongoing drama with the aim of presenting the Japanese 
language in its naturalistic form (that is, in the natural speed and as used in 
naturalistic settings). The drama forms the core of each program and includes 
a large part of the linguistic items and cultural aspects presented (Sakata 
1991). Table 3.2 lists the target items covered in the material for this period. 

The research questions set for this period were as follows: 

(1) What strategies do learners use in learning through lessons 
presented via audiovisual media? 

(2) Are there any developmental changes in the manner in which 
learners study over the time period? 

Also, at the end of Period 1, the learners reported the problems that they 
encountered in learning with the audiovisual self-instruction materials, 
particularly in coping with the variety and amount of information provided 
via the medium (reported in Umino 2003). Thus, the study also aimed to 
identify the strategies that learners developed in order to cope with this 
problem. 


3.2. Procedure for data collection and analysis 


For the interview sessions in Period 2, the following procedure was 
employed in order to obtain learners’ reports on strategy use while they are 
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viewing the lessons presented via the video materials. The videotape was 
paused at several points during the lesson, and the learners were asked to 
report their thought processes. The points at which the tape was paused 
corresponded to the constituents of each lesson, which were (1) the showing 
of a naturalistic drama to contextualize the new learning items (for 
odd-numbered lessons) or review (for even-numbered lessons), (2) the 
presentation of new linguistic items, (3) the showing of example skits and 
explanations, (4) writing and pronunciation instructions, and (5) the showing 
of the portion of the drama for consolidation. The first session (Session 12) 
was used as a practice session to ensure that the participants understood the 
procedure. At the conclusion of the seven sessions, I interviewed each 
individual learner for the last time in order to obtain the learners’ overall 
reflections on the seven sessions. All the interviews were conducted in 
English —the participants’ mother tongue—and were recorded on audiotapes 
and transcribed for analysis. Thus, another aim of the study was to observe 
learners’ strategy use according to the instructional sequence. 

Using the transcribed data, I identified the units in the learners’ 
responses that represented a particular strategy. Then, I assigned to each unit 
either a strategy category identified in the previous works or a new category; 
I represented each category by a keyword appearing in the learners’ words or 
adapted categories from the previous works after revising their definitions. In 
this way, I developed a list of strategy categories. In order to ensure 
intra-rater consistency, after a one-week interval, I repeated the procedure 
with new transcripts. I subsequently compared the first and the second 
coding and resolved the discrepancies, and then determined a final set of 
categories. I also had an independent rater cross-check the coding with a 
selected transcript. There were no differences in our coding of this transcript. 
The strategies identified in this way were then grouped into three 
superordinate categories: metacognitive, cognitive, and affective strategies 
(see Table 4.1). 


4. Results 
4.1. Types of strategies used for audiovisual comprehension 
Table 4.1 lists the 20 strategy types identified according to the basic 
distinction of metacognitive, cognitive, and affective strategies (O’ Malley 
and Chamot 1990). Since the task was transactional viewing, no social 
strategy was reported. Four cognitive strategy categories—warming up, 
recalling, summarizing, and developing a question—were newly developed. 
If we consider the range of strategies identified, we see that those 
identified as commonly used listening strategies in the previous studies 
(reviewed in section 2 above) are also used in audiovisual comprehension, 
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namely, inferencing, translating, inducing, note-taking, resourcing, selective 
attention, and self-monitoring. Furthermore, the list also includes strategies 
that received little or no attention in the previous works but are important in 
attending to an instructional sequence within a lesson. For example, warming 
up, recalling, predicting, and developing a question are used at the beginning 
of a lesson in order to anticipate the upcoming lesson, whereas summarizing 
and absorbing are used at the end of a lesson. These are strategies that 
become important when participating in a lesson; however, they were not 
highlighted in the previous works, in which strategy research was carried out 
in decontextualized settings (such as merely listening to a passage on tape). 
This indicates the need to further investigate the types of strategies used in 
actual instructional settings in order to understand the strategies that are 
found to be useful in instructed learning settings. In addition, strategies that 
are unique to learning with naturalistic drama material were also reported. 
For example, warming up (tuning one’s ears in order to be prepared for the 
task of listening to naturalistic speech) was necessary for those learners who 
had no exposure to Japanese in their daily lives. Getting the gist and 
inferencing were also necessary when viewing the naturalistic skit for the 
first time since it contained many linguistic items that were new to the 
learners. The two affective strategies, tolerating ambiguity (not worrying 
about the uncertainty in relation to the content of the material) and lowering 
anxiety (attempting not to panic), also related to dealing with naturalistic 
speech’. 


Table 4.1 Reported strategies for audiovisual comprehension 


Metacognitive strategies 

1 Selective attention: Deciding in advance to attend to specific aspects of language 
input or situational details that assist in the performance of a task; attending to specific 
aspects of language input during task execution 

2 Self-monitoring: Checking, verifying, or correcting one’s comprehension, performance, 
or use of learned or self-developed rules in the course of a language task 

Cognitive strategies 

3 Translating: Rendering ideas from one language to another in a relatively verbatim 
manner 

4 Inferencing: Surmising the meanings or usage of unfamiliar language items associated 
with a language task in order to fill in the missing information by using the available 
information 


8 These two strategies were adopted from Naiman, Frohlich, Stern, and Todesco’s (1978) 
original work. 
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5 Inducing: Consciously looking for and/or developing a hypothetical rule from the 
given input material 

6 Summarizing: Making a mental or written summary of the language and information 
presented in a task 

7 Note-taking: Writing down key words and concepts in abbreviated verbal, graphic, or 
numerical forms in order to assist the performance of a language task 

8 Resourcing: Using the available reference sources of information on the target 
language, including dictionaries, textbooks, and previous works 

9 Elaborating: Relating different parts of the new information; making meaningful 
personal associations with the information presented 

10 Repeating: Repeating a chunk of language (a word or phrase) in the course of 
performing a language task 

11 Transfer: Using previously acquired linguistic knowledge to facilitate a language task 

12 Predicting: Making predictions regarding language content that learners have not yet 
come across 

13 Getting the gist: Understanding the general idea of a material rather than a specific 
item 

14 Decomposing: Deconstructing the sequence into chunks so as to understand the 
structure of a language item 

15 Warming-up: Tuning one’s ears so as to be prepared for a language task 

16 Recalling: Remembering what has been learned previously 

17 Absorbing: Simply listening to or reading the learned material in order to absorb it 

18 Developing a question: Developing a question and attending to the material bearing 
the question in mind 

Affective strategies 

19 Tolerating ambiguity: Making a deliberate effort not to worry about the uncertainty in 
relation to the content of the material 

20 Lowering anxiety: Reducing anxiety by using mental techniques that make one feel 


competent to carry out the learning task 


4.2, Developmental changes in strategy use 

If we examine the developmental changes in the learners’ strategy use 
over the period, we can observe some developments in their use of certain 
strategies. In the final interviews at the end of the seven sessions, the learners 
reported three types of changes: changes in the frequency of use, the manner 
of use, and the combination in which they use certain strategies. Further, 
these changes appear to result from the learners’ attempts to overcome the 
problem that they encounter in learning with this medium, that is, coping 
with information overload. 
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4.2.1. Change in the frequency and manner of strategy use 

In terms of the frequency of strategy use, the learners reported that they 
began to make more frequent use of selective attention—a strategy to deal 
with selecting a particular item to focus on at a time. It is a useful strategy 
for coping with the variety and amount of information provided via the video. 
In the extract given below, Robert describes how he coped with this specific 
problem by selectively focussing on different aspects of the input each time. 


Extract 4-1 

Robert: The first time, I focus on each word and how it is written in Roma-ji’ in order to 
check if I can identify whether it is a question, which tense, etc. Then, I look at the 
sentence itself to see if there is any pattern emerging through the structure of the 
sentence that will help me at a later stage. For example, ka is a question marker. 


On the other hand, the learners reported changing the manner of using 
the resourcing and note-taking strategies. Resourcing in this context refers to 
using the available reference sources such as textbooks, learners’ notes, and 
dictionaries. This may interfere with the comprehension of the lessons if 
carried out while viewing the video. However, as Jane reports below, the 
learners began to refer to the other materials prior to viewing the video so 
that they could focus on the lessons. 


Extract 4-2 

Jane: I also make sure that I read the notes [textbook] every week before coming here so 
that I know what’s coming. Otherwise, I find myself always looking at the notes as 
I watch the programs. I find this practice quite useful. Rather than being glued to 
the notes during the actual program and missing parts of it, if I look at the notes 
beforehand, I can pay more attention to the program while it’s going on. Further, 
it’s a bit easier to understand the program if you have an idea of what’s coming. 


With regard to note-taking, the learners reported using this strategy to a 
lesser extent in Period 2 as compared with Period 1; however, they also 
reported that the quality of their notes had improved. As Lindsay and 
Norman (1977) point out, note-taking is a decision to focus away from the 
text and to focus on the act of writing. Thus, in order that learners can 
achieve a compromise between the need to attend to the video and the need 
to record selected information, they need to somehow minimize the time that 
they spend focussing on writing. The analysis of the actual notes that the 
learners took confirms the decrease in the quantity and the change in the 


9 a i : 
Roma-ji is a method of writing Japanese in Roman letters. 
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quality of their notes. Tables 4.2 and 4.3 show the number of note entries 
made by Jane, Robert, and Dale”’ in Periods 1 and 2 respectively. As shown 
in Table 4.2, the total number of note entries made during Period 1 was 163 
for eight lessons, with an average of 6.8 notes per lesson. All the three 
learners took extensive notes, but Jane in particular made 95 entries for eight 
lessons, with an average of 11.8 notes per lesson. In Period 2, however, the 
number of note entries was reduced to a total of 30 notes for five lessons, 
with an average of 2 notes per lesson. Jane took only 18 notes in total, with 
an average of 3.6 notes per lesson. 


Table 4.2. Number of note entries made during the video lessons in Period 1 


Lesson Jane Dale | Robert | Total 
Lesson 3 0 9 0 9 
Lesson 4 28 0 18 46 
Lesson 6 19 0 0 19 
Lesson 7 11 8 0 19 
Lesson 8 0 10 16 
Lesson 10 5 3 3 11 
Lesson 11 11 0 11 
Lesson 12 15 0 17 32 
Total 95 20 48 163 


Table 4.3 Number of note entries made during the video lessons in Period 2 


Lesson Jane Dale | Robert | Total 
Lesson 19 6 0 5 11 
Lesson 21 1 0 1 2 
Lesson 23 5 0 3 
Lesson 24 2 0 3 5 
Lesson 25 4 0 0 
Total 18 0 12 30 


Regarding the type of notes taken, at the beginning of Period 1, the 
learners simply wrote down a word or a sentence and its translation or the 
explanation for its use (such as “oishii—good tasting” or “Kono saki no 
migigawa—up ahead on the right”). However, as the sessions proceeded, the 
learners began to draw diagrams to summarize the content, questions (for 
example, questions on the pronunciation of the particle ga should be 
nasalized), and information to link a new item to the previously learned 
items. An example of the latter is in Lesson 23, when Jane wrote down the 


0 | could collect notes from only these three learners at the end of Period 2. 
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two sentence patterns “Doushite - desu ka” (Why —?) and “Doushita n desu 
ka” (What’s happened?) along with the newly learned pattern “Doushita n 
deshou ka” (What might have happened?) in order to consolidate her 
knowledge of the relevant expressions. The learners also made notes in 
relation to what was written in the textbook (with the use of arrows and 
highlights). Rost (1987) points out that such concept-ordering notes (for 
example, sequencing and relating items) and focussing notes (highlighting or 
dehighlighting) are regarded as more efficient than the mere verbatim 
transcribing or copying of the speaker’s utterances. In this light, the learners 
became more efficient at taking notes, which allowed them to pay more 
attention to the video itself. 


4.2.2. Combination of strategies 

The third noteworthy change in relation to the learners’ strategy use is 
the development of fixed routines of a combination of several strategies. In 
particular, the three strategies of selective attention, resourcing, and 
note-taking discussed above were combined effectively to cope with the 
problem of information overload. Such combinations eventually formed a 
routine and were used repeatedly. For example, Dale developed a routine as 
described in Extract 4-3. 


Extract 4-3 

Dale: I’ve developed a process in which, when I’m going through the skits, I first look 
and listen. I then try and match it up with what’s in here (the textbook). I still get 
stuck on words occasionally. I’m still not too heavily focussed on sentence 
construction. I tend to be more focussed on the meaning, the individual words. 
Further, I still try to literally translate and compare the translation with the 
meanings in the script and the translation that it contains, and try and identify the 
point of difference and why it exists. I find I’m doing that more and more. And at 
times, I’m switching from.... I’m not relying so heavily on the visuals now. The 
first time I see the skit, I probably watch it and rely heavily on the visuals. The 
second time the skit comes up, I concentrate on the visuals, try to pick up things 
that are very familiar to me, and identify from the skit exactly what’s going on to 
fill in the gaps. Revisiting it, I go through it again and try to put it all in place. So 
now, I’m much more structured in the way I’m doing it. And I can see myself doing 
it now, whereas before, I hadn’t thought about the way I was learning. I just sat 
there. However, the information happened to sink in. So I had no particular pattern 
that I was conscious of. 


To summarize Dale’s method, he first attempts to get the gist of the 
proceedings when viewing the long skit for the first time (getting the gist). 
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Then, in the second viewing, he selectively attends to the particular 
feature(s) of the language (selective attention), using the textbook to match 
the language with the written script (resourcing). Next, he translates the 
Japanese into English (translating) and compares his translation with that 
provided in the textbook (resourcing) in order to identify the difference. With 
this fixed routine, Dale’s strategy use became more efficient, leaving more 
room for him to concentrate on the learning content than on the process itself 
(see Table 4.4 for a summary of this routine.) 


Table 4.4 Summary of Dale’s strategy use routine 


(1) First viewing of Yan’s skit: Get the gist of the skit, relying on the visuals. [getting the 
gist] 

(2) Second viewing: Concentrate on the language rather than the visuals. Look at the 
textbook and match the language with the script, focussing on the familiar words. 
[selective attention/resourcing] 

(3) Translate the Japanese into English. [translating] 

(4) Compare the translation with that given in the textbook. Identify the difference and 


the reason for it. [translating/resourcing] 


Similarly, Robert reports a specific method he developed for improving 
his pronunciation. Robert reads the scripts of the skits in the textbook 
beforehand. In Extract 4-4, Robert describes his use of the textbook material, 
which is different from that of Dale’s. 


Extract 4-4 

Robert: I read the material beforehand so that as the video plays, I will be able to mentally 
or physically cross-reference it. If a word comes up that is pronounced differently 
from what I think, I then try to look up that word and analyze my reasons for 
pronouncing it in that way. For example, for a long time, I thought that the word 
yoroshiku was pronounced as yoroshiku, but it’s actually yoroshku. It’s squashed a 
little bit. So I look at the written form twice and then listen to it again. This helps 
me with the pronunciation. I think it is particularly effective. I have to read first and 
try to pronounce the word in the manner in which I think it is pronounced. And 
very often it is completely different. 

Tae: Then what do you do? 

Robert: I then go back and work again on pronouncing the word until the correct way to 
pronounce it stays in my memory. And I just analyze why I was pronouncing it 
differently. 


Here, Robert first reads the script of the drama in the textbook at home 
[resourcing], predicting how the words might be pronounced (predicting). 
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When viewing the drama on the video, he pays attention to the pronunciation 
(selective attention). If the words are pronounced in a manner different from 
his predictions, he looks at the words in the script and attempts to analyze the 
reason for his incorrect prediction (self-monitoring/resourcing). He then 
practices pronouncing the words in the way they were pronounced 
(repeating) (see Table 4.5 for a summary of Robert’s routine). 


Table 4.5 Summary of Robert’s strategy use routine 


(1) Read the script of the skits in the textbook material at home. [resourcing] 

(2) Predict how the words in the script might be pronounced. [predicting] 

(3) Watch and listen to the skits, paying attention to the pronunciation. [selective 
attention] 

(4) If the words are pronounced differently from the prediction, listen to the 
pronunciation in the skits carefully, look at the words in the script, and attempt to 
analyze why the prediction was incorrect. [self-monitoring/resourcing] 

(5) Afterward, practice pronouncing the word in the way it was pronounced. [repeating ] 


The development of routines of a certain combination of strategies to 
meet learners’ particular needs is a noteworthy feature of the learners’ 
strategy use. This type of routinization in strategy use is an under-researched 
area in strategy research and should be investigated further in future 
research. 


5. Discussion 

Through the study, we found that the strategies commonly used for 
listening are also used for audiovisual comprehension. However, we also 
identified (1) some types of strategies that are useful for audiovisual 
comprehension in particular and (2) the ways in which these strategies are 
used by learners in order to benefit from this mode of learning. 

Of particular interest to us was the finding that the learners developed 
the use of certain strategies—in particular, selective attention, resourcing, 
and note-taking—in order to cope with the information overload 
characteristic to learning with audiovisual media. This issue can be discussed 
in light of attentional control, whose role in L2 acquisition research has been 
discussed more generally. The role of attention in L2 learning is based on the 
premise that learning is dependent on the learner’s active mental engagement 
with the incoming information. Schmidt’s (1990) “noticing hypothesis” 
holds that learners must first have a conscious awareness of a particular 
linguistic form before it can be processed. Tomlin and Villa (1994), on the 
other hand, argue that mere noticing is not more critical than other 
attentional processes, such as “orientation”’—which is “the direction of 
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attentional resources to a type or class of sensory information at the expense 
of others’—and “detection”—which is defined as “the process that selects, 
or engages, a particular and specific bit of information within a type” (p. 
185). The basic premise here is that L2 learners are always bombarded with 
overwhelming amounts and types of L2 input, and the attentional system is 
crucial to “sort out that input and to bring order to the chaos threatening to, 
and sometimes succeeding in, overwhelming the learner” (Tomlin and Villa 
1994:184). In the case of learning with multimedia materials, in particular, 
having control over the attentional process becomes more crucial than in the 
case of learning with single-medium materials (such as text-only or 
audio-only materials); this is due to the nature of the medium. Thus, learning 
how to control one’s attention is one area that learners need to develop in 
learning effectively with audiovisual materials. 

Second, we also found that the learners developed routines of a 
combination of strategies in coping with certain needs that they identified. 
The development of such routines or “routinization” is a notion that is 
understood in the context of skills acquisition in information processing 
models (as represented in the works of McLaughlin 1987; McLaughlin and 
Heredia 1996; Levelt 1978; Hulstijn 1990) as a means of improving the 
efficiency of learning. In this framework, acquiring task knowledge (Wenden 
1998), including the knowledge of language acquisition tasks, is regarded as 
an acquisition of certain skills and the integration of the skills necessary for 
the fluent performance of a task. Further, because human capacity is limited, 
a mechanism is assumed through which such integration of a number of 
different skills is practiced and automatized. The cognitive scientists 
Schneider and Shiffrin (Shiffrin and Schneider 1977; Schneider and Shiffrin 
1984) long ago identified two processing procedures in this process: 
controlled processing—which requires attentional control for the activation 
of informational elements stored in the long-term memory—and automatic 
processing—in which the activation of stored informational elements is a 
learned response that has been built up through the consistent mapping of the 
same input to the same pattern of activation over many trials. Once the 
controlled processing becomes routinized, the controlled processes are 
released to be allocated to higher levels of processing. In this way, the 
development of complex cognitive skills involves the building up of a set of 
well-learned, automatic procedures so that the controlled processes will be 
released for new tasks. Seen in this light, the routinization of strategy use is 
similarly necessary and important for improving the efficiency of strategy 
use. It is an aspect of strategy use that should be pursued further and in a 
more general manner in strategy research. 
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6. Conclusion and implications 

This paper discussed a longitudinal investigation into the self-instructional 
process of L2 adult learners learning from audiovisual media. Through the 
study, we identified the strategies that learners employ during audiovisual 
comprehension, although these strategies are still hypothetical and need 
further validation. The study as a whole shed light on the process of 
material-centred self-instruction (Dickinson 1987) occurring outside the 
teachers’ instructional control. However, obtaining a more complete picture 
of the self-instruction process would require the investigation of other 
self-instruction cases, using other types of materials, and in other learning 
contexts. 

As implications for pedagogy, this study draws attention to the need for 
including various types of support among the self-instructing learners’ 
learning materials in order to assist their learning. The learners in this study, 
who studied in foreign language learning contexts outside Japan, had 
minimum exposure to Japanese and were new to the audiovisual materials. 
Therefore, they initially faced difficulties in learning with this medium. 
Those learning under similar circumstances can be encouraged to use the 
strategies reported in this study. For example, learners who have a problem 
coping with information overload can be encouraged to pay attention to a 
particular feature of the input at a time, make effective use of the textbook 
before and after the lessons, and take brief but effective notes. Similarly, 
those who have little or no exposure to Japanese in their daily lives ought to 
be told not to panic if they do not understand everything that they hear; they 
should be encouraged to use strategies that are appropriate for the task at 
hand. Such insights can be incorporated as tips or guidelines among the 
learning materials as assistance in self-instruction". 

We can offer two additional suggestions from the methodological point 
of view. First, in this study, we collected learners’ “online” verbal report data 
while they were engaged in a lesson (provided via video). This enabled us to 
identify some new strategies that were not found in the previous works and 
the ways in which these strategies are put to use in relation to the structure of 
the lesson. The results suggest that the ways in which strategies are used in 
actual learning settings may differ from those found in arbitrary task settings 
set for data collection, as employed in the previous works. This implies the 
need for considering contextual factors in a more general manner while 


1 As an example of such assistance, I am currently developing a “learner’s guide” for the 
Web-based Japanese learning material “TUFS Dialogue Module” (The 21* Century COE 
Program “Usage-Based Linguistic Informatics,” Graduate School of Area and Culture 
Studies, Tokyo University of Foreign Studies), which includes the use of strategies as 
suggested in this study. 
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investigating strategies in instructed learning. We must be aware that this 
way of data collection was made possible due to the characteristic of the 
medium being used, namely, the ability to pause the lesson at any point that 
we wish; this would have been more difficult in some other instructional 
settings. However, even in the case of classroom learning, for example, it is 
still possible to employ methods such as stimulated recall procedures in 
order to elicit learners’ strategy use during a lesson, or to combine the prior 
analysis of instructional sequences with retrospective interviews in order to 
elicit descriptions of chronological strategy use in relation to the 
instructional sequence. These points ought to be considered in future 
research. 

Second, the study investigated learners’ strategy use in a longitudinal 
fashion. This allowed us to observe how learners eventually developed the 
routinization of strategy use through repeated interactions with the materials. 
Such development is an interesting aspect of strategy application worthy of 
further investigation and has particular implications for strategy instruction. 
However, this has not been highlighted in the previous works, which relied 
on cross-sectional methodology. The above suggests the need for more 
longitudinal approaches in order to obtain a better understanding of how 
more effective uses of strategies are developed among individual learners. 
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Network-Based Computer Assisted 
Language Learning (CALL) 
— Emergent Research Issues — 


Mark PETERSON 


This article examines the application and impact of network computer 
technologies in the field of CALL. The rapid expansion in the use of 
network-based tools in language education brings forth new opportunities to 
engage learners in activities that foster L2 development. This discussion 
will examine the application of both synchronous and asynchronous 
communication tools in learner-based projects in order to identify the aspects 
of computer-mediated communication (CMC) that may have a role to play in 
supporting second language development in the online domain. The final 
section of this chapter will focus on identifying areas of potential for future 
research. 


1. Introduction 

Two main types of network-based tool have been implemented in 
CALL projects. Asynchronous tools such as e-mail and electronic bulletin 
boards have been utilized in a number of studies (Barson et al., 1993; Kern, 
1996; Soh & Soon, 1991). These tools enable target language (TL) 
communication to take place via text over both short and long periods of 
time. Synchronous environments such as LAN-based conferencing tools, 
videoconferencing, IRC chat, MOOs (multi-user object-orientated domains) 
and 3D virtual worlds facilitate real-time interaction in the target language 
(Buckett et al., 1999; Chun, 1994; Hudson & Bruckman, 2002; Kelm, 1992; 
Kern, 1995; Pelletteri, 2000; Von Der Emde et al., 2001; Toyoda & Harrison, 
2002; Tudini, 2003). CALL projects utilizing both asynchronous and 
synchronous communication tools have reported encouraging results (Ortega, 
1997;Peterson, 1997; Sotillio, 2000). This paper will examine a number of 
projects that have utilized these tools in an attempt to facilitate second 
language learning. This paper also attempts to identify factors that may have 
a significant positive influence on learners’ interlanguage development in 
network environments. 
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2. Asynchronous communication tools in CALL 

A number of learner-based projects have attempted to investigate the 
potential of asynchronous network communication tools in CALL. An early 
project described by Soh & Soon (1991) reported on the bulletin board 
system (BBS)-based interaction of two groups of EFL learners based in 
Singapore and Quebec. The researchers reported that participation led to 
improved discourse management skills as the learners created their own 
tasks and took responsibility for their leaning. This research also reported 
evidence for enhanced critical thinking skills and improvement in fluency 
that was the result of the TL learner-centered nature of the interaction. A 
study by Barson et al., (1993) explored the use of e-mail exchange in a 
project involving learners based at three universities. This research 
highlighted the collaborative nature of the interaction. These researchers 
reported that the students communicated using complex L2 structures and 
that this phenomenon was the result of the motivation effects of authentic 
interaction with peers and writing for an audience. The results of a project 
based on learner e-mail exchange involving non-native speakers (NNSs) of 
French based in the United States and native speakers (NSs) of French 
(located in France) were reported in Kern (1996). Kern’s study highlighted 
the beneficial effects of online interaction on learners’ knowledge of the 
target culture. Research conducted into the use of e-mail in a Spanish 
language undergraduate course reported by Gonzalez-Bueno (1998) 
produced a number of positive results. The researcher found that the use of 
e-mail stimulated increased TL output, improvements in the quality of the L2 
produced and a high level of accuracy (as the delayed nature of the 
interaction provided for increased planning time). This study also claimed 
that during the interaction the subjects produced a wide variety of language 
topics and functions. A comparative study conducted by Warschauer (1996) 
examined the attitudes to participation in e-mail-based exchanges by 
language learners located in three countries. The results showed high levels 
of motivation engendered by the international nature of the communication 
with other learners. This study emphasized the sense of empowerment and 
enjoyment made possible by the learner-centered nature of the interaction. 
This author highlights the fact that these benefits were enhanced when the 
e-mail-based interaction was integrated into the curriculum in a focused 
coherent manner. The positive results of the above studies demonstrate the 
potential of asynchronous communication tools in CALL. These benefits are 
summarized in table 1. 
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Table 1. Benefits of asynchronous communication in CALL 


Opportunities for TL interaction 


Increased planning time (as learners do not need to provide an immediate response) 


The production of a wide variety of topics and functions 


Enhanced motivation (caused by interaction with peers) 


Improved accuracy (the result of increased planning time and feedback) 


Improved discourse management skills 


Development of cross-cultural understanding and knowledge of the target language 
culture (in international projects) 


Enhanced critical thinking skills 


Collaborative learning 


Fosters awareness of writing for an audience 


Enjoyment 


Researchers have also attempted to utilize synchronous communications 
tools in CALL. The discussion will now examine the results of studies based 
on the application of these tools in a number of language learning projects. 


3. Synchronous environments in CALL 

Early attempts to utilize synchronous (real time) network technologies in 
CALL centered on the application of LAN-based writing environments. A 
well-known example of this type of conferencing technology is the Daedalus 
writing environment (http://www.daedalus.com/) originally developed for the 
English program at the University of Texas. This writing tool was designed 
to provide a collaborative learning environment. Daedalus incorporates a 
conferencing tool and heuristics for prewriting and topic development. An 
early study of native speaker (NS) interaction in Daedalus by Bump (1990) 
found evidence of increased written output. This researcher further observed 
the emergence of collaborative learning behaviors. Bump speculated that 
these behaviors were the result of the reduced turn-taking pressures inherent 
in online communication and the egalitarian nature of much online discourse. 
A study of Portuguese language students interaction by Beauvois (1992) 
found that the anonymity afford by synchronous discussion resulted in 
increased learner participation, motivation and TL output. Following on this 
work, a study of non-native speaker (NNS) interaction by Chun (1994) found 
that participation in a Daedalus-based class discussion project fostered the 
development of communicative competence on the part of learners. The 
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subjects in this study initiated and sustained TL discourse while at the same 
time expanding their linguistic competency. A comparative study of NNS 
interaction in oral classes and networks by Kern (1995), reported that 
students interacting via Daedalus produced increased written output and 
number of turns. Moreover, the learners also produced a wider variety of 
discourse functions than in oral discussions. This study found that these 
results were due in part to the absence of distracters such as age, sex and 
status that can (in certain circumstances) act as barriers to learning in 
conventional classrooms. These early studies highlighted the potential of 
synchronous technologies in CALL. Further advances in network 
technologies were stimulated by the emergence of the Internet in the mid 
1990’s, and educators have been active in applying these new tools in the 
language classroom. 


4. Chat systems 

Synchronous chat tools are found on bulletin boards, servers and web 
sites in many parts of the Internet. These tools enable users to communicate 
via text in real time. 

A well-known example of a chat tool used extensively in education is 
Internet Relay Chat (IRC). As with most other chat tools, IRC contains many 
theme-based “channels”, that is virtual spaces where individuals may log-in 
anonymously using an alias and converse with other users through the use of 
text-based commands. In chat tools such as IRC users’ written output is 
displayed as soon as it is sent, enabling users to monitor their linguistic 
output. User interaction in IRC is fast paced and primarily social in nature 
(Werry, 1996). The capacity of chat tools such as IRC to promote learner- 
centered, cooperative learning has made these environments a focus of 
investigation in contemporary CALL research. Chat tools have been 
utilized in a number of CALL research projects. A study of learner 
interaction in a chat environment by Kitade (2000) found that students of 
Japanese as a second language engaged in self-correction and various TL 
repair behaviors. On this basis of these findings, Kitade concluded that 
these environments provide for collaborative learning and the production of 
comprehensible input. A further study by Blake (2000) found that in a 
task-based project involving chats between native and non-native speakers of 
Spanish, the learners engaged in the negotiation of meaning. Blake also 
reported that the data recording capacities of chat programs may be usefully 
employed in CALL, by providing learners with opportunities to enhance 
their metacognitive awareness through examination of chat files. Following 
on this research, a paper by Hudson and Bruckman (2002), reported that 
non-native speakers of French in a chat-based project engaged in 
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meaning-focused interaction. 
This study found that the anonymity afforded by interacting online reduced 
levels of inhibition during communication. The promising findings of these 
studies have also been replicated to a degree in other more recent projects. 
These studies focus on the investigation of learner interaction in unique 
forms of CMC environment. 


5. MOO environments 

Of the many forms of CMC tool now being applied in CALL text-based 
virtual worlds known as MOOs, bring a new dynamic to second language 
pedagogy and research (Peterson, 2001). These environments share a number 
of features found in other forms of synchronous CMC. MOOs are databases 
designed to support real time text-based communication between users and 
they consist of many virtual spaces known as rooms. Unlike other forms of 
text-based chat such as IRC, in MOOs rooms are user-created and are linked 
together to form extensive theme-based virtual worlds. Users access a MOO 
world by completing an alias-based login protocol. This provides anonymity 
and enables recording of the interaction. On completion of this procedure, 
users are free to traverse virtual space and communicate via a series of 
text-based commands. Recent versions of MOOs are browser-based and 
utilise hypertext and multi-media content. Studies of learner interaction in 
MOOs report that these environments offer a higher degree of permanence 
and community than is found in most other forms of network-based learning 
(Donaldson & Kotter, 1999; Kotter 2003; Von Der Emde et al., 2001). 
Several factors account for these findings. As MOOs evolved from MUDs 
(multiple-user domains) text-based role-playing games, they engender a 
higher degree of participation, commitment and sense of reciprocity on the 
part of users. Studies report that during participation in MOO-based 
learning these behaviors are reinforced due to the fact that in MOOs users 
can create and develop a unique individualized online persona (Peterson, 
2001;Von Der Emde et al., 2001,). Moreover, a hierarchy of user access 
coupled to the object-orientated nature of MOOs enables learners to actively 
participate in the creation and manipulation of objects within the MOO 
environment. For example, in MOOs users with the appropriate level of 
access can create and provide textual descriptions of their own rooms and 
add other types of content such as audio or video to the MOO. These 
activities create a sense of ownership and enhance the sense of immersion 
reported by learners who have taken part in MOO projects (Von Der Emde et 
al., 2001). A further unique feature of these virtual worlds is that they adopt 
many of the powerful learning metaphors of the traditional classroom. In 
MOOs learners can utilize virtual projectors, cameras, VCR’s and TV’s to 
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create, share and manipulate virtual objects. This aspect of MOOs makes 
them ideally suited to task-based learning focused on learner needs. 
Furthermore, in these flexible environments learners may participate in 
virtual lectures, seminars and presentations. The networked open-access 
nature of MOO virtual worlds facilitates collaborative learning projects with 
students based in other countries, thus providing opportunities to 
communicate with a wider range of interlocutors than may be found in many 
conventional classrooms. MOO virtual worlds offer a unique combination 
of features that result in a richer more structured learning environment than 
can be found in most other forms of CMC utilized in CALL (Shield, 2003). 
An example of a MOO virtual world is shown in the screen capture 
reproduced below: 


e098 @ The Cup-O MUD Client = 

@ e ff = 3 
Back Refresh = Home Print Mail 

U1) nttp://sohmooze hunter .cuny.edu/test.ntmi go 


al 
johvinus exits: North, Classrenm, EFI, and Hovine way 
= 
3 bbe eeeeawee baa eedeeeee deeb bd eweeeewes Adee eneweee errs eeereeeyy 
3.| |* Welcome Lo schmooze! 
@| \* If this is your first time here, please type 


* CLASSROOM to learn the basic soo commands . 
PUSS T TUTTE TTUCIISUTITTTT TITS TTT TT TTT Tere ere rit tree rire) 
You head north to the mall 

The Archway (southern end of campus) 


AsOs1H4 


[This is a moderate-sized archway built from limestone mined from a nearby 
quarry. Vievally, it frames the fountain that fronte the white facade of the 
\Library, which is in turn capped by a clocktower. The Library is lined on 
either side by the trees and brown brick buildings of the Great Mall. As you 
look up you notice the following words carved in the stone: 

TIME FLIES LIKE AN ARROW; FRUIT FLIES LIKE A BANANA. 


yruees 


You stop a few minutes to ponder their meaning. 


g <+-The campus clock Lower reads 2:18 a.m, EST=«> 
&| jbvious exits: north and saurh 
$ | |kinp [Cuest] looke at the map of cchMooze. 
— 2. —s POD 
= | Culture Center | | Library | Garden yard | 
s \ Bs | 
s / Student \ Class- | yard 
= | Union | North Mall | xooms | | 
‘ | | | | 
fonne- \ | 
‘ . D 
Central Mall ° ¥ | ° | Pr 
I | | | RJA 
west Mall Hast mall jimi s 
-Smunaseesess, T 
| Administration | South Mall Conference | u 
| at Center | Rr 
(-X-h\ l E 
ee | | ee he 
Bovine Way | Entrance Cate | X- ~ You are here. 
5 ° eee Archway 
MOOrrey's Bar \ / 
Applet star bed. 


Figure 1. The Schmooze University EFL MOO 
Although the application of MOOs in CALL is a relatively recent 


phenomenon, a number of preliminary research studies have been conducted. 
In a study describing a small-scale tandem learning project involving 
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students of German as a second language based in America and learners of 
English based in Germany, Donaldson and Kotter (1999) found that 
participation in MOO-based learning promoted collaborative learning, 
motivation and the development of learner autonomy. In this project the 
subjects rapidly took control of their learning and displayed a high degree of 
initiative and control when undertaking the tasks. These findings were 
corroborated in a more recent study conducted by Von Der Emde et al. 
(2001). This study focused on the interaction of two groups of undergraduate 
language students based in Germany and the United States. 

The researchers reported that learning in MOOs appeared to support 
learner-focused exploratory learning and peer teaching. A further significant 
aspect of the interaction between the two groups was a high degree of TL use, 
and the development of autonomous learning behaviors that fostered a sense 
of community between participants. A project reported in Kotter (2003) 
found that the NNSs subjects engaged in TL discourse while interacting with 
NSs in a tandem learning project. A positive finding of this study was the 
fact that the learners employed a variety of discourse management strategies 
including clarification, comprehension and confirmation checks. This 
indicates that interaction in MOOs may support the development of L2 
management skills. This view would appear to be supported by a key feature 
of the data. In examining the log files of the learners’ linguistic output, 
Kotter found evidence that the subjects engaged in the negotiation of 
meaning. This finding in combination with the results of the above studies 
highlights the potential beneficial effects of participation in MOO-based 
CALL projects. 


6. Immersive 3D virtual worlds 

Development work in CMC technologies has focused on creating a 
more personalized and immersive level of interaction than has been provided 
by conventional chat technologies. This research effort has been motivated 
by the desire to overcome some of the apparent limitations of communication 
in text-based forms of real time CMC, where the visual and prosodic cues 
that influence face-to-face communication are absent or reduced. Recent 
developments in network-based virtual reality (VR) technologies have raised 
the possibility of utilizing these advanced environments in CALL. One of 
the most promising of these environments is Active Worlds (http://www. 
activeworlds.com). This browser-based VR environment incorporates a chat 
tool in combination with a 3 D graphical interface that is designed to 
enhance immersion and a sense of co-presence among users. In addition, this 
platform is object-orientated, thus enabling users to create their own unique 
theme-based virtual worlds. In Active Worlds users can communicate and 
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navigate in real-time through the use of text and hypertext-based commands. 
The web site provides access to numerous user-created worlds accessible 
through a central gateway world known as Alphaworld. Many of these 
worlds are accessible to guest users. The web site also contains links to a 
help database. Active Worlds further attempts to provide users with a sense 
of presence within the environment through an innovation in VR 
technologies. The use of graphical embodiments of self, known as avatars. 
These graphical images can be designed by their owners (in the case of 
registered users) and are capable of replicating a number of the non-verbal 
cues that are absent in most chat environments. For example, avatars can 
make gestures (such as waving) and show emotion (including happiness) on 
screen in real time, thus enhancing communication. Moreover, these entities 
can also traverse virtual space and interact with other avatars, providing for a 
novel and motivational communication experience. A screen capture of the 
Active Worlds interface is reproduced below: 


eee @ Active Warlds Fducational Unvierse - Addition =) 
2 ff = = 

Refresh Home Aut Print Mail 
) http :/ edu.sctiveworlds.com/ go 


; @ 


AWEDU 


Sd OAR, 


The Educational Universe is an entire Active Worlds Universe dedicated to exploring the educational applications of 
the Active Worlds Technology. 


ALO1SIH 


yooudes \. yess 


ploy aBeg 


It is our hope that by making the Active Worlds Iechnology available to educational institutions, teachers, 
students, and individual programs at lower costs; educators will be able to explore new concepts, learning theories, 
creative curriculum design, and discover new paradigms in social learning 


Take a Virtual Tour of some of the worlds designed by the participants in the Active Worlds Educational Universe. 


Uick here to tind out more about being part of the Active Worlds Educational Universe. 


NEW! Click here to launch the 3d Classroom creator. 


Internet zone 


Figure 2. The Active Worlds interface 
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Although research into the use of advanced VR technologies in CALL 
remains at an early stage, a study of NS-NNS interaction in an Active Worlds 
environment conducted by Toyoda and Harrison reported some positive 
results (2002). These researchers found that despite technological problems, 
and difficulties in mastering user commands learners of Japanese as a second 
language adopted various strategies to overcome communication problems 
with native speakers while interacting in a purpose built Active Worlds 
environment. In this project, the learners engaged in TL discourse while 
utilizing their avatars. This study found evidence of the kind of negotiation 
of meaning that is held to be an important influence on the processes at work 
on SLA. While the findings of this study have yet to replicated, these 
preliminary results highlight the potential of immersive virtual worlds in 
CALL. 


7. Videoconferencing 

Videoconferencing technology offers a means to bring together groups 
of learners for collaborative real time interaction. This technology offers the 
additional advantages of providing enhanced visual feedback and online 
collaborative work spaces (shared whiteboards). Early development work in 
the use of video-based tools in language education was hampered by 
bandwidth problems. Recent improvements in the data transfer capacity of 
the Internet and developments in software have stimulated a number of 
video-based CALL projects and the results have been encouraging (see for 
example Bucket et al., 1999). Although desktop videoconferencing has yet to 
become a viable tool for large groups, research suggests that this form of 
synchronous communication has considerable potential as a language 
learning tool (Kinginger, 1998; O’Dowd, 2000). For example, a small-scale 
study conducted by Buckett & Stringer (1997) involving an inter-university 
language project reported that videoconferencing technology facilitated 
collaborative student centered interaction. Furthermore, this research 
reported positive feedback from the participants. A more recent study conducted 
by Wang (2004) involving Chinese language learners reported similar results 
and claimed (on the basis of questionnaire data) that the subjects found 
interacting via video improved a number of language skills. The learners 
reported that interacting via video enabled them to improve their listening 
and discourse management skills. The subjects claimed that taking part in 
video sessions enabled them to build confidence and improve their 
pronunciation. Although research into the use of videoconferencing in CALL 
remains at the developmental stage as the technology becomes more 
accessible, researchers will attempt to exploit the learning opportunities 
provided by this form of CMC. 
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Table 2._ Benefits of synchronous communication in CALL 


Opportunities for TL-based interaction (comprehensible input and modified output) 


Enhanced feedback 


Cooperative learning 


Monitoring 


Increased linguistic output compared to conventional classrooms 


Negotiation of meaning 


Reduced inhibition 


Increased motivation 


Development of learner autonomy 


Enhanced social and discourse management skills 


Increased fluency 


Avatars enhance the sense of co-presence and engagement in virtual worlds 


8. Network-based environments in CALL: Directions for future research 
The implementation of network-based communication tools in CALL 
raises a number of research issues. The results of existing studies involving 
learner interaction in CMC highlight new areas that require investigation. 
Given the emergent nature of the discourse produced in CMC-based 
interaction there remains a need to identify the significant linguistic and 
interactional features of this form of communication. At present, as Ortega 
(1997) has noted, the nature of CMC discourse remains unclear. 
Some researchers view the discourse produced in CMC as a unique form 
distinct from speaking and writing (Werry, 1996), while others perceive a 
blend of these forms (Herring, 2001). The limited extent of the current 
literature highlights the need for further research efforts in this area in order 
to clarify the status of the learner discourse produced in both synchronous 
and asynchronous CMC. This effort would facilitate comparisons with 
studies based on conventional classroom interaction. At present, researchers 
have a limited understanding of how the typed text-based nature of the 
communication in most forms of CMC influences the interaction in 
network-based CALL projects. 
Some researchers have argued (Kern, 1995; Swaffar, 1998) that online 
interaction promotes learning by removing distracters such as age and status 
while at the same time affording anonymity that encourages participation. 
There remains a pressing need to conduct both qualitative and quantitative 
studies in order to investigate these claims and establish how interaction 
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management works for non-native speakers in network-based online 
environments. The results of such studies may inform the development of 
pedagogy in this area of CALL. Moreover, there is a need for research that 
accounts for the effects of variables such as proficiency levels, computer 
skills and learning styles on learner interlanguage development in network- 
based CALL. Investigation of the above variables offers the possibility of 
greatly enhancing the benefits of participation in network-based learning. A 
further issue of importance to advances in the area of CMC-based CALL is 
the nature and extent of learner negotiation. As negotiation of meaning is 
viewed as an important influence on the processes at work in SLA (Gass, 
1997; Pica, 1994; Varonis & Gass, 1985) future research activities in 
network-based CALL are needed in order to focus on investigating the 
degree to which negotiation takes place in CMC environments and the ways 
this behavior can be fostered. In this context, studies that investigate the 
design and role of online learning tasks are of particular relevance. In the 
future, researchers may also consider the effectiveness of participation in the 
various types of CMG, in order to identify the forms of synchronous and 
synchronous communication that are best suited to the support of different 
aspects of learners’ interlanguage development. For example, current 
research suggests that participation in real time interaction results in greater 
L2 output but a higher incidence of errors than in asynchronous forms of 
interaction (Warschauer, 1998). Furthermore, studies show that synchronous 
interaction contributes to the development of fluency, as learners are pushed 
to produce output by the communicative need engendered by the real time 
nature of the interaction (Kitade, 2000; Lee, 2001;2002). In contrast, research 
indicates that participation in asynchronous CALL activities promotes the 
development of accuracy (Gonzalez-Bueno, 1998; Sotillo, 2000) as the learners 
have more time to reflect on their TL output. These results require 
investigation by means of large-scale longitudinal studies. 

As network-based tools offer opportunities for communication with 
native speakers and other learners in diverse locations, the investigation of 
how cross-cultural knowledge and skills can be enhanced through 
participation in international projects remains an area of considerable scope 
for future research in CALL. A study of the web-based interaction of French 
learners (based in America) and English learners (located in France) reported 
in Furstenberg et al., (2001) demonstrates that this form of interaction can 
facilitate and enhance understanding of the TL culture. The researchers in 
this study used a web site and online forum to enable learners to exchange 
opinions on various forms of TL content drawn for both France and the 
United States. The content discussed included films, texts and online news 
media. The authors of this research conducted post-study questionnaires; 
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these showed that participation in the project enhanced the subjects’ 
knowledge of the cultural differences between America and France and also 
improved their understanding of the TL culture. These encouraging results 
emphasize the need for research that investigates the impact of participation 
in international network-based CALL projects on the development of 
learners cross-cultural knowledge and understanding of the TL culture. 

Much of the current debate regarding the future of CALL is based on 
two conflicting views regarding the most effective way to proceed. 
Interactionist researchers (for example Chapelle, 1997) take the view that the 
most fruitful way forward in CALL is to base research efforts on the findings 
of SLA research. This school of thought argues that only this approach 
provides a principled basis for development in CALL. In contrast, opponents 
of this view (Harrington & Levy, 2001; Salaberry, 1999) claim that CMC 
introduces a radically new dynamic in CALL, that requires the consideration 
of a wider set of research variables than is found in traditional interactionist 
research. In short, a new broader approach that encompasses recent 
conceptions of the technologically influenced social nature of online 
cognition. While a consensus on this debate has yet to emerge, future 
research projects in CALL require a perspective that accounts for the many 
variables that influence interlanguage development in the online domain. 
Part of this research effort requires a focus on the impact of the various 
forms of CMC on language teaching pedagogy. Many early studies on the 
application of CMC environments in CALL highlighted the fact that the 
anonymity afford by learning on-line, coupled to the egalitarian nature of 
electronic discourse, had the effect of radically impacting on traditional 
conceptions of teacher and learning (Peterson, 1997; Warschauer et al., 1996). 
Many proponents of the utilization of CMC in CALL argue that 
network-based learning empowers learners disadvantaged in conventional 
classrooms and at the same time supports the development of discourse 
management skills and learner autonomy (Chun, 1994; Donladson & Kotter, 
1999;Von der Emde at al., 2001). Moreover, from this viewpoint 
participation in network-based learning engineers a major shift in classroom 
dynamics from the traditional teacher-led view of learning toward a 
learner-centered model. The role of the teacher in the on-line classroom is 
therefore transformed to that of facilitator (Warschauer et al., 1996). This 
positive view of online learning has developed in the context of the 
emergence of new conceptions of SLA that stress the role of social 
interaction in learning. What remains to be developed however, is a 
perspective that informs CALL research on not only why learning takes 
place but also how language acquisition occurs and may be supported in 
CMC. This major undertaking requires a range of both empirical and 
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qualitative studies that examine the many cognitive and affective factors that 
may promote learning in CMC. 


9. Conclusion 

The application of synchronous and asynchronous CMC environments 
in language education represents an important new development in the field 
of CALL. 
The increasing implementation of CMC technologies has introduced a new 
dynamic to CALL research and pedagogy that is rich in possibilities. The 
existing literature indicates that these tools when combined with task-based 
pedagogies, create the conditions in which language acquisition processes 
are fostered. Moreover, the data recording capacities of computers provide 
researchers with a unique opportunity to assess the nature of learner 
communication in networked classrooms. The study of the discourse 
produced in CMC environments provides a new perspective on learners’ 
interlanguage development (Blake, 2000). In the future, researchers will 
continue the task of exploring this new form of communication with a view 
to increasing our understanding of the many complex processes at work in 
networked learning. There remains a need for research studies that adopt 
qualitative and quantitative approaches, in order to explore how the various 
forms of synchronous and asynchronous environment utilized in CALL 
influence learner interaction and second language learning. This research 
effort may also focus on the development of new pedagogical approaches 
that capitalize on the advantages offered by CMC technologies. In a wider 
context, the rise of CMC-based CALL highlights the need to more fully 
explore how technology, pedagogy and learning interact in the networked 
classrooms of the 21“ century. 
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